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We present a measurement of the ti production cross section using events with one charged lepton 
and jets from pp collisions at a center-of-mass energy of 1.96 TeV. In these events, heavy flavor quarks 
from top quark decay are identified with a secondary vertex tagging algorithm. From 162 pb -1 of 
data collected by the Collider Detector at Fermilab, a total of 48 candidate events are selected, 
where 13.5 ± 1.8 events are expected from background contributions. We measure a ti production 
cross section of 5.61^ \ (stat.)^°'g(syst.) pb. 



PACS numbers: 13.85.Ni, 13.85.Qk, 14.65.Ha 



I. INTRODUCTION 



The top quark is pair-produced in pp collisions through 
quark-antiquark annihilation and gluon-gluon fusion. 
The measurement of the ti cross section tests the QCD 
calculations for the pair production of a massive colored 
triplet. These calculations have been performed in per- 
turbation theory to next-to- leading order 0, Q . Recent 
work on corrections for soft gluon emission show that 
their effect on the cross section is small, and that they 
reduce the theoretical uncertainty arising from the choice 



of renormalization and factorization scales to less than 
5% over the expected range of top masses and parton 
distribution functions (PDFs). The leading theoretical 
uncertainties are in the PDFs, arising mostly from the 
understanding of the gluon distributions at large par- 
ton x. The total theoretical uncertainty is approximately 
15% 0. At ^=1.96 TeV, the predicted ti production 
cross section is a t t = 6.7+p'g pb at m t — 175 GeV/c 2 @|. 
For every 1 GeV/c 2 increase in the top mass over the 
interval 170 < m t < 190GeV/c 2 , the ti cross section 
decreases by 0.2 pb. 

The Standard Model top quark decays to a W bo- 
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son and a b quark almost 100% of the time. Top quark 
pair production thus gives rise to two W bosons and 
two "b jets" from b quark fragmentation. When exactly 
one W decays leptonically, the ti event typically con- 
tains a high transverse momentum lepton, missing trans- 
verse energy from the undetected neutrino, and four high 
transverse momentum jets, two of which originate from 
b quarks. This mode is labelled a W plus jets" or "lepton 
plus jets." Since the final state branching ratio is di- 
rectly related to the W branching ratios, the ti rate into 
a particular final state measures both the production and 
decay properties of the top quark. An unexpected result 
could thus indicate either a non-standard source of top- 
like events, or a modification of the top decay branching 
ratios. 

The pp collisions for this measurement of ti production 
were produced during Run II of the Fermilab Tevatron. 
The data were recorded at CDF II, a general purpose 
detector which combines charged particle tracking, sam- 
pling calorimeters, and fine-grained muon detection. Iso- 
lating the lepton plus jets decay mode of the top quark 
builds on the detailed understanding of inclusive lep- 
tonic W boson decays in CDF II 3J. The ti signature 
is mimicked by processes in which a W boson is pro- 
duced in association with several hadronic jets with large 
transverse momentum. To separate the ti events from 
this background we use precision silicon tracking to b- 
tag jets containing a secondary vertex from a b hadron 
decay. Background contributions from fake Ws, mis- 
identified secondary vertices and heavy flavor production 
processes such as Wbb are estimated using a combination 
of Monte Carlo calculations and independent measure- 
ments in control data samples. An excess in the number 
of events which contain a lepton, missing energy, and 
three or more jets with at least one 6-tag is the signal of 
ti production and is used to measure the production cross 
section a t i- The dataset defined by this analysis forms 
the basis for other measurements of top quark properties, 
such as the top quark mass and the helicity of W bosons 
produced in top decays. 

This measurement builds on the 6-tagging techniques 
employed by CDF at the Tevatron Run I. Then, at 
-yi=1.8 TeV, a similar analysis of lepton+jets events 
with &-tags gave a ti cross section of o t t = 5.1 ± 1.5 
pb 0, compared to an expected value of a t i — 5.2+g'^ 
pb at m t = 175GeV/c 2 0. Here, using a larger dataset 
collected at higher center-of-mass energy as well as im- 
proved Monte Carlo tools and detector simulations, we 
have re-analyzed the heavy flavor fraction in W events 
and improved our understanding of 6-tagging efficiencies, 
including the contribution of material interactions to fake 
b tags. In addition, the significance of the measurement 
is optimized by requiring a large scalar sum of the trans- 
verse energies of all objects in the event (Ht), which 
improves the rejection of background events. 

Our analysis complements other recent ti cross section 
determinations at CDF II using dilepton events p| or 
using lepton plus jets events with 6-tags and a kinemat- 



ically derived estimate of the 6-tagged backgrounds [||. 
The work of Reference |(| is particularly relevant to the 
measurement described here, in that it uses the same 
b-tagged event sample, but calculates the backgrounds 
by appealing to a data control sample available only in 
the W plus jets selection. Our technique for background 
estimation is significantly more general, and this paper 
establishes the ability to use b-tagging in many other 
kinds of measurements at CDF in the future. We com- 
ment further on this matter at the end of Section III. 

The organization of this paper is as follows. Sec. [Q] 
reviews the detector systems and event reconstruction 
techniques relevant to this measurement. The trigger 
and sample selections are described in Sec. IHII The b- 
tagging algorithm, its efficiency for tagging & jets, and the 
understanding of its fake rate are discussed in Sec. IIVI 
The means for estimating backgrounds from processes 
which produce a W in association with heavy flavor are 
described in Sec. In Sec. IVII our understanding of 
mistags and backgrounds is applied to collate a compre- 
hensive estimate of all tagged contributions to the lepton 
+ jets sample, and this estimate is compared with the 
data. A cross-check of the background estimation, using 
the Z + jets sample, is presented in Sec. lVIII In Sec. lVIIll 
an optimization using the total transverse energy in the 
event to improve the cross section measurement uncer- 
tainty is described, along with the acceptance associated 
with this event selection. The ti production cross section 
measured in events with at least one &-tagged jet is pre- 
sented in Sec. IIXI the result in events with at least two 
6-tagged jets is presented in Sec.|X] The final results are 
summarized in Sec. IXII 



II. EVENT DETECTION AND 
RECONSTRUCTION 

The CDF II detector is described using a cylindri- 
cal coordinate system with the z coordinate along the 
proton direction, the azimuthal angle <fi, and the po- 
lar angle 6 usually expressed through the pseudorapid- 
ity r\ = — ln(tan(0/2)). The rectangular coordinates x 
and y point radially outward and vertically upward from 
the Tevatron ring, respectively. The detector is approxi- 
mately symmetric in r/ and 4>- 



A. Charged Particle Tracking 

Drift cell and silicon microstrip systems provide 
charged particle tracking information in the region < 
1.0 and I77I < 2.0, respectively. The tracking systems are 
contained in a 3.2 m diameter, 5 m long superconducting 
solenoid which produces a 1.4 T magnetic field aligned 
coaxially with the pp beams, allowing measurement of 
charged particle momentum transverse to the beamline 
(.Pt)- 

The Central Outer Tracker (COT) is a 3.1 m long open 
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cell drift chamber which performs 96 track measurements 
in the region between 0.40 and 1.37 m from the beam 
axis Q. Sense wires are arranged in 8 alternating axial 
and ±2° stereo "superlayers" with 12 wires each. The 
position resolution of a single drift time measurement is 
approximately 140 /im. 

Charged particle trajectories are found first as a series 
of approximate line segments in the individual axial su- 
perlayers. Two complementary algorithms are used to 
associate segments lying on a common circle, and the re- 
sults are merged to yield a final set of axial tracks. Track 
segments in the stereo superlayers are associated with 
axial track segments to reconstruct tracks in three di- 
mensions. COT tracks used in this analysis are required 
to have at least 3 axial and 3 stereo superlayers with 7 
hits per superlayer. 

The efficiency for finding isolated high momentum 
tracks is measured using electrons from W — > e^v which 
are identified in the central region \r\\ < 1.1 using only the 
calorimetric information for the electron shower and the 
missing transverse energy (see below). In these events, 
the efficiency for finding the electron track is found to be 
99.93^Q 35%, and this is typical for high momentum iso- 
lated tracks from either electronic or muonic W decays 
which are contained in the COT. For high-momentum 
tracks, the transverse momentum resolution is found to 
be Spt/pt ~ 0.1% ■ pT(GeV), the track position resolu- 
tion at the origin is Sz w 0.5 cm in the direction along the 
beamline and the resolution on the track impact param- 
eter, or distance from the beamline at the track's closest 
approach in the transverse plane, is Sdo rs 350/im. 

A road-based hardware pattern recognition algorithm 
runs online in the eXtremely Fast Tracker (XFT) to pro- 
vide track information for triggering || . Drift times par- 
titioned into two time bins are used to find the axial seg- 
ments which are matched in their positions and slopes. 
An "XFT track" is one which has four matching axial 
segments on a trajectory. The XFT efficiency is mea- 
sured in a set of well measured COT tracks which pass 
through all 4 axial superlayers. The XFT is found to have 
an average efficiency of 96.7 ± 0.1% for charged particles 
with momenta greater than 25 GeV/c. 

Inside the inner radius of the COT, a five layer double- 
sided silicon microstrip detector (SVX) covers the region 
between 2.5 to 11 cm from the beam axis Three 
separate SVX barrel modules are juxtaposed along the 
beamline to cover a length of 96 cm, approximately 90% 
of the luminous beam intersection region. Three of the 
five layers combine an r — <f> measurement on one side and 
a 90° stereo measurement on the other, and the remain- 
ing two layers combine r — <f> with small angle stereo at 
±1.2°. The typical silicon hit resolution is 11 /im. Addi- 
tional Intermediate Silicon Layers (ISL) at radii between 
19 and 30 cm in the central region link tracks in the COT 
to hits in SVX. 

Silicon hit information is added to reconstructed COT 
tracks using a progressive "Outside-In" (OI) tracking al- 
gorithm. COT tracks are extrapolated into the silicon 



detector, associated silicon hits are found, and the track 
is refit with the added information of the silicon mea- 
surements. The initial track parameters provide a width 
for a search road in a given layer. Then, for each can- 
didate hit in that layer, the track is refit and used to 
define the search road into the next layer. The stepwise 
addition of the precision SVX information at each layer 
progressively reduces the size of the search road, while 
also properly accounting for the additional uncertainty 
due to multiple scattering in each layer. The search uses 
the two best candidate hits in each layer to generate a 
small tree of final track candidates, from which the tracks 
with the best x 2 are selected. The efficiency for associat- 
ing at least three silicon hits with an isolated COT track 
is 91 ± 1%. The extrapolated impact parameter resolu- 
tion for high momentum OI tracks is 30 /im, including 
the uncertainty in the beam position. 



B. Calorimetry for Electrons and Jets 

Outside of the tracking systems and the solenoid, 
segmented calorimeters with projective geometry are 
used to reconstruct electromagnetic (EM) showers and 
jets 0, IHl- The EM and hadronic calorimeters 
are lead-scintillator and iron-scintillator sampling de- 
vices, respectively. The calorimeter is segmented into 
"towers" , each covering a small range of pseudo-rapidity 
and azimuth; the full array covers 27r azimuth over the 
pseudo-rapidity range \rj\ < 3.6. The transverse energy 
Et — E sin 9 is measured in each calorimeter tower, 
where the polar angle is calculated using the measured z 
position of the event vertex. Proportional and scintillat- 
ing strip detectors measure the transverse profile of EM 
showers at a depth corresponding to the shower maxi- 
mum. 

High momentum jets, photons, and electrons leave 
isolated energy deposits in small contiguous groups of 
calorimeter towers which can be identified and summed 
together into an energy "cluster." For the purpose of trig- 
gering, online processors organize the calorimeter tower 
information into separate lists of clusters for the electro- 
magnetic compartments alone and for the electromag- 
netic and hadronic compartments combined. Electrons 
are identified in the central electromagnetic calorimeter 
(CEM) as isolated, mostly electromagnetic clusters which 
match with an XFT track, in the pseudorapidity range 

M<i.i. 

The electron transverse energy is reconstructed from 
the electromagnetic cluster with a precision <j(Et) I Et = 
13.5%/V£W(GeV) © 2% 0. Jets are identified as 
a group of electromagnetic and hadronic calorimeter 
cl usters whic h fall within a cone of radius AR = 
\J A<fi 2 + At/ 2 < 0.4 [lj|. Jet energies are corrected for 
calorimeter non-linearity, losses in the gaps between tow- 
ers |l5j . and multiple primary interactions. The jet en- 
ergy resolution is approximately (0.1(E T /(GeV)) + 1.0) 
GeV EfJ. 
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C. Muon Detection and Reconstruction 

For this analysis, muons are detected in three sepa- 
rate subdetectors. Directly outside of the calorimeter, 
four-layer stacks of planar drift chambers (CMU) detect 
muons with px > 1.4 GeV/c which pen etrate the five 
absorption lengths of the calorimeter [17]]. Farther out, 
behind another 60 cm of steel, an additional four layers 
(CMP) detect muons with p T > 2.0 GeV/c 18] . The 
two systems cover the same part of the central region 
|r/| < 0.6, although the CMU and CMP have different 
structures and their geometrical coverages do not over- 
lap exactly. Muons between 0.6 < \rj\ < 1.0 pass through 
at least four drift layers lying on a conic section out- 
side of the central calorimeter; this system (CMX) com- 
pletes the coverage over the full fiducial region of the 
COT tracker 0. The presence of a penetrating muon 
is reconstructed as a line segment or "stub" in one of the 
four-layer stacks. Muon candidates are then identified as 
isolated tracks which extrapolate to the stubs. A track 
which is linked to both CMU and CMP stubs is called a 
CMUP muon. 



D. Beam Positions and the Primary Interaction 
Vertex 

The event selection depends on reconstructing sec- 
ondary vertices from b hadron decays. The identification 
of these decay vertices requires a precise measurement 
of the primary vertex, the point from which all prompt 
tracks originate. The primary vertex location in a given 
event can be found by fitting well-measured tracks to a 
common point of origin. 

The locus of all primary vertices defines the "beam- 
line," the position of the luminous region of the beam- 
beam collisions through the detector. The beamline can 
be used as a constraint to refine the knowledge of the pri- 
mary vertex in a given event. The first estimate of the 
primary vertices (xy , yy, zy) is binned in the z coordi- 
nate. A linear fit to (xy,yy) vs. zy yields the beamline 
of each run section. 

The luminous region is long, with a z = 29 cm. The 
transverse cross section is circular, with a width of ap- 
proximately 30 fim at z = 0, rising to « 50 — 60 /im at \z\ 
= 40 cm. The beam is neither parallel to nor centered 
in the detector. At z = 0, the beamline is at (xy, yy) « 
(—2.0,3.9) mm, and has a slope of ss 5.0//m/cm in the 
horizontal plane and ~ 1.7/im/cm in the vertical plane. 
These parameters are rather stable, varying from their 
mean positions by no more than 20% during periods 
of continuous data taking. 

At high luminosities, more than one collision can oc- 
cur on a given bunch crossing; the primary vertices of 
the collision are typically separated in the z coordinate. 
For the data analyzed here, there are an average of 1.4 
reconstructed vertices per event. The z position of each 
vertex is calculated from the weighted average of the z 



coordinates of all tracks within 1 cm of a first iteration 
vertex, with a typical resolution of 100 /im. 

A final determination uses all of the information above 
to recalculate a best primary vertex in each candidate 
event for the 6-tagging procedure. This precise calcu- 
lation, using a beam constraint and OI tracks, is de- 
scribed fully in Sec. I1VI As part of the lepton + jets 
event selection, the events are required to have the re- 
constructed primary vertex located inside the luminous 
region (\z\ < 60 cm). 



III. DATA SAMPLES AND EVENT SELECTION 
A. Colliding Beam Data 

The colliding beam data used in this analysis were 
recorded during the period March 2002 - August 2003, 
when the instantaneous Tevatron luminosity ranged from 
0.5 to 4.0 xlO 31 cm" 2 s- 1 . 

Cherenkov light detectors in the very forward region 
(| 77 1 > 3.7) record information on the instantaneous and 
total integrated luminosity of the Tevatron [Pj . The to- 
tal integrated luminosity for this period is 193 ± 12 pb _1 ; 
after quality requirements on the silicon tracking, the 
data sample used for this analysis amounts to 162 ± 
10 pb' 1 for CEM electrons and CMUP muons, and 
150 ± 9 pb _1 for CMX muons. 

For the primary data samples used in this analysis, the 
detector is triggered on high momentum electrons and 
muons. The electron hardware triggers require an XFT 
track with pr > 8 GeV/c matched to an EM cluster with 
Et > 16 GeV and the ratio of hadronic to electromag- 
netic energy less than 0.125. The muon hardware trig- 
gers require an XFT track with pt > 8 GeV/c matched 
to muon stubs in the joint CMUP configuration or in 
the CMX. A complete version of the offline lepton selec- 
tion is performed online in the last stage of triggering, 
and repeated in offline processing with updated calibra- 
tion constants. Other secondary datasets described in 
Sec. IIVI use a jet trigger with a certain Et threshold or 
an electron trigger with relaxed Et requirements. 



B. Monte Carlo Samples 

The understanding of acceptances, efficiencies, and 
backgrounds relies on detailed simulation of physics pro- 
cesses and the detector response. Most measurements 
of acceptance and efficiency rely on pythia v6.2 [2(j or 
herwig v6.4 0, These generators employ lead- 

ing order matrix elements for the hard parton scattering, 
followed by parton showering to simulate gluon radiation 
and fragmentation. Each generator is used in conjunction 
with the CTEQ5L parton distribution functions [23|. For 
heavy flavor jets, we interface to QQ v9.1 [24( to provide 
proper modeling of b and c hadron decays. 
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The estimate of the ^-tagging backgrounds due to 
higher order QCD processes such as Wbb requires spe- 
cial care. This study of backgrounds in the 6-tagged 
sample uses the ALPGEN program [2^, which generates 
high multiplicity partonic final states using exact leading- 
order matrix elements. The parton level events are then 
passed to herwig and QQ for parton showering and b 
and c hadron decay. Further discussion of ALPGEN can 
be found in Sec. [V] 

The CDF II detector simulation reproduces the re- 
sponse of the detector to particles produced in pp col- 
lisions. The same detector geometry database is used 
in both the simulation and the reconstruction, and 
tracking of particles through matter is performed with 
GEANT3 [26j . Charge deposition in the silicon detectors is 
calculated using a simple geometrical model based on the 
path length of the ionizing particle and an unrestricted 
Landau distribution. The drift model for the COT uses a 
parameterization of a Garfield simulation, with the pa- 
rameters tuned to match COT data Q. The calorimeter 
simulation uses the GFLASH [27| parameterization pack- 
age interfaced with GEANT3. The GFLASH parameters 
are tuned to test beam data for electrons and high-p^ 
pions, and they are checked by comparing the calorime- 
ter energy of isolated tracks in the collision data to their 
momenta as measured in the COT. Further detail on the 
CDF II simulation can be found elsewhere |2Sj . 



C. W + Jets Selection 

The selection identifies events consistent with the W 
+ jets signature containing a high-momentum electron or 
muon (hereafter referred to as "lepton," £), large missing 
transverse energy, and hadronic jets. The event selection 
is summarized below. 

The offline electron selection requires an EM cluster 
with Et > 20 GeV matched to a track with pt > 
lOGeV/c. The cluster is required to have an electro- 
magnetic fraction and shower shape consistent with an 
electron deposit. The extrapolated track is required to 
match the shower location as measured in the shower 
maximum strip detector, and to have a momentum con- 
sistent with the shower energy. Finally, since the electron 
from W decay is expected to be isolated from other en- 
ergy deposits in the calorimeter, the energy in a cone of 
radius Ai? = 0.4 around the electron cluster, but not 
including the cluster itself, is measured, and the isola- 
tion ratio of the energy in the cone to the energy of the 
electron is required to be less than 0.1. 

Photon conversions in the detector material are a 
source of electron backgrounds. A conversion is defined 
as a pair of tracks (one of them the electron) satisfying 
the following cuts: 

• oppositely charged, 

• |A(xy)| < 2 mm, and 



• |A(cot0)| < 0.04, 

where A(xy) is the distance between the tracks in the r — 
<t> plane at the point where they are parallel in that plane, 
and A(cot0) is the difference between the cotangents of 
the polar angles of the two tracks. Electrons that are 
part of an identified conversion pair are not considered 
further in the electron selection. 

The offline muon selection requires a COT track with 
p T > 20 GeV/c matched to a CMUP or CMX muon stub. 
The matching is based on the extrapolated track position 
at the chambers, accounting for the effects of multiple 
scattering. The energy in the calorimeter tower contain- 
ing the muon is required to be consistent with the depo- 
sition expected from a minimum ionizing particle. Back- 
grounds from cosmic rays are removed by requiring that 
the track extrapolates to the origin, and that the min- 
imum ionizing tower energy deposit is within a narrow 
timing window around the beam crossing. 

In these high momentum lepton samples, the signal of 
the neutrino from W — > Iv is large missing transverse 
energy, $ T . The $ T is calculated as the vector sum of 
the energy in each calorimeter tower multiplied by the 
azimuthal direction of the tower. If isolated high momen- 
tum muons are found in the event, the $ T is corrected 
by subtracting the muon energy in the calorimeter and 
adding the muon px to the vector sum. The selection 
finally requires $ T > 20 GeV. 

In addition to the direct t — > ev e b and t — > [iv^b modes, 
this event selection has a small acceptance for top final 
states with W — > tv and a subsequent leptonic r decay, 
or with high momentum semi- leptonic b quark decays. 
These are included in the signal acceptances calculated 
in Sec. IX. 

Z bosons and top dilepton decays that contribute to 
the inclusive high px lepton dataset are removed by flag- 
ging the presence of a second lepton. Any event with two 
leptons satisfying the lepton identification is removed, as 
well as those events where the second lepton is an elec- 
tron in the plug calorimeter or a muon that fails the 
CMUP requirement, but has one CMU or CMP muon 
segment. Finally, we attempt to remove Z bosons with- 
out a well identified second lepton by eliminating events 
with one lepton and certain second objects which form an 
invariant mass between 76 and 106 GeV/c 2 with the pri- 
mary lepton. For primary muons the other object is an 
opposite-signed isolated track with px > 10 GeV/c. For 
primary electrons the second object may be such a track, 
an electromagnetic cluster, or a jet with Et > 15 GeV 
and 1 77 1 < 2.0 that has fewer than three tracks and an 
electromagnetic energy fraction greater than 95%. The 
correction for the residual Z boson contribution to the 
W + jets sample is described in Section IVIII Small con- 
tributions from Z — > tt where a r is tagged are treated 
as a separate background, and described in Section IVT1 

The number of jets produced in association with the 
leptonically decaying W in the event is measured by se- 
lecting jets of cone radius Ai? = 0.4, with Et > 15 GeV 
and \t)\ < 2.0. The jets are clustered after removing tow- 
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FIG. 1: Transverse mass of the identified lepton and inferred 
neutrino, consistent with W boson production (162 pb _1 data 
sample) . 



events with W + 1 jet or W + 2 jets, where the ti contri- 
bution is negligible, serve as a check of the background 
prediction. 

In Reference [f| , the momentum spectrum of the lead- 
ing jets in the W + 1 and 2 jet events is shown to be 
a reasonable model of the backgrounds in the W + 3 or 
4 jet events, and is used in deriving a completely inde- 
pendent estimate of the b-tag backgrounds to top pro- 
duction in the W + 3 or 4 jet channels. The estimated 
background, 18 ±4 events, is in good agreement with our 
overall estimate of 23 ± 3 (for Ht > 0), derived from an 
explicit calculation for each contributing background pro- 
cess (see Sec I VI \ . The independence of these background 
estimates allows for a combined cross section calculation 
which will appear in a future paper. We note here that 
the technique of Reference 6] will work only in the study 
of top quarks in the W plus jet mode, and we consider it 
a cross-check on our more general technique for calculat- 
ing b-tag backgrounds, which will be employed in other 
b-tagging analyses at CDF. 

The final cross section calculation, a t i — (N b s — 
■^ r bk g )/(ett x £), depends on the product e t i of signal ac- 
ceptance and selection efficiency, the expected number of 
non-tt background events -/Vbkg, and the integrated lumi- 
nosity C. 



IV. SECONDARY VERTEX 6- TAGGING 



ers associated with the selected isolated electron from 
the leptonic W decay, and after correcting the tower Et 
for the location of the primary vertex z coordinate. The 
number of events in each jet multiplicity bin is shown in 
Table HJ The overall acceptance x efficiency of this selec- 
tion for ti events in the lepton+jets channel with three 
or more jets, including the leptonic branching ratios, is 
roughly 4% for the electron channel, 2% for muons in the 
CMUP, and 1% for muons in the CMX. 

The presence of the W boson in the selected 
events is verified by calculating the transverse mass 
of the lepton and the missing energy: Mt — 

\J (Et(€) + Et(v)) 2 — {P^{t) + Pr{v)) 2 . The distribu- 
tion of this variable for all events passing the requirement 
of a lepton, missing energy, and at least one jet is shown 
in Fig. n and displays the Jacobian edge associated with 
W production and decay. 

As a final optimization step, the selection will incor- 
porate an additional cut on the total transverse energy 
Ht of all objects in the event. Events from ti production 
have, on average, a significantly greater total transverse 
energy than background events. The optimization of this 
requirement and acceptance corrections and uncertain- 
ties will be discussed in Sec. I Villi 

Because the ti signal is expected to contribute signifi- 
cantly to the sample of events with jy + 3 jets or W + 
> 4 jets, an excess of observed events over the expected 
background with those jet multiplicities is assumed to be 
entirely due to ti production. The observed results for 



In this Section we describe and discuss the perfor- 
mance of an algorithm to identify jets resulting from 
heavy quark (b,c) fragmentation. This "SecVtx" algo- 
rithm is based upon the algorithm used to discover the 
top quark pj. Most of the non-ti processes found in the 
W + jets sample do not contain heavy quarks in the final 
state. Requiring that one or more of the jets in the event 
be tagged by SecVtx keeps more than half of the ti events 
while removing approximately 95% of the background. 



A. Description of the SecVtx Algorithm 

The SecVtx algorithm relies on the displacement of 
secondary vertices relative to the primary event vertex 
to identify b hadron decays. The Run II algorithm is 
essentially unchanged from Run I [4|, but the track se- 
lection cuts have been re-tuned for the CDF II detector. 

In order to select displaced tracks coming from decays 
of long-lived hadrons, precise knowledge of the collision 
point is necessary. To find an event-by-event primary 
vertex, we first identify which of the vertices described 
in Section [H] is nearest the identified high-momentum 
electron or muon. For other datasets without high- 
momentum leptons, we use the vertex which has the 
highest total scalar sum of transverse momentum of as- 
sociated tracks. The position of the primary vertex is 
then determined by fitting together the tracks within a 
±1 cm window in z around this vertex. The procedure 
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TABLE I: Number of events selected, before 6-tagging, for each jet multiplicity. 
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starts by fitting a vertex using all tracks within the z 
window and with impact parameter significance (relative 
to the beamline) \do/<Td \ < 3, where <7<j includes the un- 
certainty on both the track and the beamline positions. 
The transverse profile of the beamline at the z of the 
original vertex estimate is also used as a constraint in 
the fit. A pruning stage removes tracks which contribute 
X 2 > 10 to the fit (or the track with the largest x 2 con- 
tribution if the total fit reduced chi-squared per degree 
of freedom % 2 /ndf > 5). After the initial pruning, the 
fit is repeated using only the remaining tracks until a 
vertex with no tracks over the \ 2 cu t is found. If no 
tracks survive the pruning stage then the beamline pro- 
file is used for the primary vertex position estimate. In 
the event sample used for these results the uncertainty 
in the fitted transverse position ranges from 10 — 32 /im 
depending upon the number of reconstructed tracks and 
the topology of the event. 

Secondary vertex tagging operates on a per-jet ba- 
sis, where only tracks within the jet cone are consid- 
ered for each jet in the event. A set of cuts involving 
the transverse momentum, the number of silicon hits at- 
tached to the tracks, the quality of those hits, and the 
X 2 /ndf of the final track fit are applied to reject poorly 
reconstructed tracks. Only jets with at least two of these 
good tracks can produce a displaced vertex; a jet is de- 
fined as "taggable" if it has two good tracks. Displaced 
tracks in the jet are selected based on the significance 
of their impact parameter with respect to the primary 
vertex and are used as input to the SecVtx algorithm. 
SecVtx uses a two-pass approach to find secondary ver- 
tices. In the first pass, using tracks with pt > 0.5 GeV/c 
and \do/<Jd \ > 2.5, it attempts to reconstruct a sec- 
ondary vertex which includes at least three tracks (at 
least one of the tracks must have px > lGeV/c). If 
the first pass is unsuccessful, it performs a second pass 
which makes tighter track requirements (pr > lGeV/c 
and jdo/frfal > 3) and attempts to reconstruct a two- 
track vertex (one track must have pr > 1.5GeV/c). 

Once a secondary vertex is found in a jet, the two- 
dimensional decay length of the secondary vertex L 2 d 
is calculated as the projection onto the jet axis, in the 
r — <fi view only, of the vector pointing from the primary 
vertex to the secondary vertex. The sign of L 2 d is de- 
fined relative to the jet direction, specifically by the abso- 
lute difference \4>\ between the jet axis and the secondary 
vertex vector (positive for < 90°, negative for > 90°). 
Secondary vertices corresponding to the decay of b and c 
hadrons are expected to have large positive L 2 d while the 
secondary vertices from random mis-measured tracks are 



expected to be less displaced from the primary vertex. To 
reduce the background from the false secondary vertices 
(mistags), a good secondary vertex is required to have 
L 2 d/vl 2D > 3 (positive tag) or L 2D /a L2D < -3 (neg- 
ative tag), where ctl, 2D , the total estimated uncertainty 
on L 2 d including the error on the primary vertex, is es- 
timated vertex- by- vertex but is typically 190 /im. The 
negative tags are useful for calculating the false positive 
tag rate, as detailed in Section IVl Bl A tagged jet is de- 
fined to be a jet containing a good secondary vertex (the 
SecVtx algorithm will find at most one good vertex per 
jet). 



B. Measurement of Tagging Efficiency 

The results described in this paper require a knowledge 
of the tagging efficiency for tt events, i.e., how often at 
least one of the jets in a tt event is positively tagged by 
SecVtx. Because it is not possible to measure this di- 
rectly in tt events we have adopted a different strategy. 
A sample of jets whose heavy flavor fraction can be mea- 
sured is used to derive the per-jet tagging efficiency in the 
data for that sample. The heavy flavor in this sample is 
a mixture of charm and bottom, with the relative pro- 
portions of each determined from the mass spectrum of 
SecVtx tagged jets and the ratio of charm/bottom tag- 
ging efficiencies predicted by the Monte Carlo simula- 
tion. The charm component is suppressed by requiring 
a second tagged jet in the event, so that the measured 
tag efficiency is dominated by the contribution from bot- 
tom. Because the jets in tt events will in general have 
different energies, pseudorapidities, and track multiplic- 
ity than the jets in the calibration sample, the measured 
efficiency cannot be used directly. Instead, a matching 
sample of Monte Carlo jets is used to determine the tag- 
ging efficiency in the simulation for jets like those in the 
calibration sample, and the ratio of efficiencies between 
data and simulation (scale factor) is then used to cor- 
rect the tagging efficiency in tt Monte Carlo samples. 
In other words, the geometrical acceptance and energy 
dependence of the tagger are taken from the simulation, 
with the overall normalization determined from the data. 

To measure the efficiency for tagging heavy flavor 
hadrons, we use a sample of low-px inclusive electron 
data which is enriched in semilcptonic decays of bottom 
and charm hadrons. For the matching Monte Carlo sam- 
ple we use the herwig [2l| program to generate 2^2 
parton events, which are passed through a filter requir- 
ing an electron with pt > 7 GeV/c and |ry| < 1.3. Events 
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passing this filter are processed using the detector simu- 
lation described in Section UTI 

Electrons in the events are identified using the selection 
in Section ITTT1 except with lower thresholds Et > 9GeV 
and track pt > 8GeV/c. Further differences from Sec- 
tion nn are that the electrons are required to be non- 
isolated and conversions are not removed. The electron 
track must also pass through every layer of the SVX de- 
tector. 

Along with the electron we require two jets, the "elec- 
tron jet" and the "away jet." The electron jet is required 
to have Et > 15 GeV (including the energy of the elec- 
tron) and to be within 0.4 of the electron in r\ — <j) space 
(in other words the electron is within the jet cone), and 
is presumed to contain the decay products of a heavy fla- 
vor hadron. The away jet is required to have Et > 15 
GeV and \r)\ < 1.5, and it must be approximately back- 
to-back with the electron jet (A</> > 2rad). A total of 
481,301 events of the data sample pass these event selec- 
tion requirements. Figures and show that the Monte 
Carlo is an adequate representation of the data sample 
for relevant event selection and tagging variables. The 
differences can be attributed to the presence of fake elec- 
trons in the data which are not completely removed even 
after requiring a SecVtx tag, and which are not present 
in the Monte Carlo due to the generator-level electron 
filter. The discrepancy in Figure on the negative side 
of the pseudo-cr plot shows that the Monte Carlo under- 
estimates the mistag rate observed in the data. 

In order to measure the tagging efficiency for electron 
jets, it is first necessary to characterize their heavy flavor 
content. Two methods are used to measure the fraction 
Ft, of electron jets which contain a b hadron. The first 
method is to reconstruct D° — * K~ir + decays within the 
electron jet and use the invariant mass sidebands to sub- 
tract background; this method yields Ft, = 0.139± 0.021. 
The second method involves searching for secondary 
muons within the electron jet resulting from cascade c 
decays using the same-sign rate to estimate the back- 
ground; this method gives Ft, = 0.228 ± 0.037. Because 
the agreement is only at the 2a level, the uncertainty on 
the weighted average is inflated by 2.09 based on the \ 2 
of the two determinations. The combined result of the 
two measurements is F& = 0.161 ± 0.038. 

The fraction F c of electron jets which came from a 
charm quark also contributes to the total heavy flavor 
fraction Fhf = Fj, + F c . An estimate of the amount of 
c relative to b in the electron jet is derived from a fit to 
the invariant mass spectrum of the tracks in the positive 
tags found in the electron jets. Templates for b, c, and 
light-flavor jets taken from the Monte Carlo (and also 
from the data for light-flavor) were fitted to the distri- 
bution, as shown in Figure 0] to obtain the ratio of c to 
b after requiring a positive tag. The result of this fit is 
F c tag /F* ag = 0.118±0.017, where the uncertainty is dom- 
inated by the systematic error due to varying the light- 
flavor template. A value for F c /Fb before any tagging is 
obtained by multiplying this result by the ratio of tag- 
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FIG. 2: Data/Monte Carlo comparison of some quantities of 
tagged electron jets (L2D > 0, identified conversions have 
been removed for plotting purposes). Histograms are nor- 
malized to unit area. From top-left, clockwise: electron Et, 
electron-jet Et, away-jet Et, electron pt- (The last bin in- 
cludes all overflow entries.) 



ging efficiencies et,/e c — 5.2 ± 0.4 predicted by the Monte 
Carlo, resulting in F c /F}> = 0.61 ± 0.10. The uncertainty 
on £b/e c is derived from reweighting the Monte Carlo to 
match the multiplicity of tracks in the jet passing the 
quality cuts which is observed in the data. Applying the 
factor of 0.61, the total heavy flavor fraction of electron 
jets F HF is 0.259 ± 0.064. 

To measure the tagging efficiency of the heavy flavor 
electron jets we employ a double-tag technique, requiring 
that the away jet be tagged by SecVtx. This enhances 
the heavy flavor fraction of the electron jets and reduces 
the dependence on Fjjf , which we were only able to con- 
strain at the 25% level. Another benefit of the double-tag 
is to reduce the influence of the charm component, so that 
the resulting heavy flavor tag efficiency is more represen- 
tative of the 6-tagging efficiency. Tagging the away jet 
reduces the charm from 61% of the bottom component 
down to around 10%. The difference in the tag efficiency 
for semilcptonic decays, which we measure, and generic 
heavy hadron decays is used later to estimate a system- 
atic error. 

The tagging efficiency for heavy flavor jets containing 
an electron, derived from the numbers of double- and 
single-tags, is 
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FIG. 3: Data/Monte Carlo comparison of some quantities of 
tagged electron jets (identified conversions have been removed 
for plotting purposes). Histograms are normalized to unit 
area. From top- left, clockwise: number of good tracks in 
the jet, number of tracks in the tagged vertex, vertex mass 
of positively tagged electron-jets; pseudo-cr of (positively or 
negatively) tagged electron-jets. 
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FIG. 4: Fit (solid line) of the relative 6 and c contributions to 
the vertex tag mass distribution. Templates for the different 
flavors are derived from simulation (and the data in the case 
of light flavor). The error bars for the data are contained 
within the markers. 



where iV a+ and iV a _ are the numbers of positive and 
negative tagged away jets, and N^t, for example, is the 
number of events where both electron and away jet are 
positive tagged. 

The factor F^ F is the fraction of electron jets contain- 
ing heavy flavor for events where the away jet is tagged. 
This number is less than one due to events where the 
away jet is mistagged or contains heavy flavor due to 



gluon splitting or flavor excitation, and the electron is 
either a fake or part of a photon conversion pair. In or- 
der to estimate these effects we use identified conversions 
(see Section ITTTf) to probe the light flavor composition of 
the electron jets. In this way we write -Fg F as 



*Sp = 1 - 



A ! ;.' 



-at; 



N.4-—N. 



AV 
N 



(I-^hf), 



(2) 



where N is the number of events passing the selection, 

e' c = % c + _x c i an d the c subscript refers to events where 
the electron was identified as a conversion. A full deriva- 
tion of this expression can be found in the Appendix B 

To illustrate the effectiveness of the conversion finder, 
Figure shows the estimated radius of the conversion 
point for identified pairs. Peaks corresponding to known 
detector structures are clearly visible. 
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FIG. 5: Radius of identified conversions in data, with location 
of the silicon detector layers (LOO, SVX and ISL), readout 
system, and ISL and COT main mechanical structures. 



ofi^ F c 



We use the Fhf value derived above for data, a value 
0.861 for the Monte Carlo (found by counting 
jets which are matched to a heavy quark), and Equa- 
tions and (see Appendix) to calculate the efficiencies 
to tag a heavy flavor jet containing an electron in data 
and Monte Carlo. The resulting values averaged over jet 
Et are given in Tabic ITU The efficiencies as a function of 
the Ei of the jet are shown in FigureEJ The ratio of data 
to Monte Carlo efficiencies (scale factor) is also shown as 
a function of Et- Additionally, a sample of jet data with 
one jet having Et > 50 GeV and a corresponding 2^2 
Pythia Monte Carlo sample have been used to deter- 
mine that the ratio of jet tag rates is flat over a wider jet 
Et range than that spanned by the electron calibration 
sample. These samples are also used to estimate a sys- 
tematic uncertainty for extrapolating the scale factor to 
the higher-Er jets (typically 40-120 GeV) characteristic 
of top quark decays. 
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TABLE II: Efficiency to tag a heavy flavor electron jet in data 
and Monte Carlo, and the data/MC ratio (scale factor). Un- 
certainties on the efficiencies are statistical only; systematic 
uncertainties on the scale factor are summarized in Tabic ITTTI 



e(Data) 


0.240 ± 0.007 


e(MC) 


0.292 ±0.010 


Scale Factor 


0.82 ± 0.06 



u 
S 

a 

'3 
S3 
w 

s 

'3d 

05 

H 



0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 




i ■ ■ ■ ■ i ■ 1 1 ■ i ■ ■ ■ ■ i ■ ■ 1 ■ i 1 1 ■ ■ i ■ ■ ■ ■ i ■ 

-Data 



4 



15 20 25 30 35 40 45 
Electron Jet E T (GeV) 



50 55 



u 
o 
+^ 

u 

OS 
"ol 

u 




25 30 35 40 45 
Electron Jet E T (GeV) 



55 



FIG. 6: Efficiency to tag a heavy flavor electron jet as a func- 
tion of jet Et in data and Monte Carlo (top), and data/MC 
scale factor (bottom). The scale factor is consistent with be- 
ing constant over the Et range investigated. The solid line 
shows the result of the fit to the binned scale factor from 
which the \ 2 was derived. It is consistent with although not 
identical to the value of 0.82 obtained for the overall sample. 



Several sources of systematic uncertainty have been 
considered and are summarized in Table 11111 The F^ F 
method uncertainty accounts for assumptions made in 
the calculation of _Fg F about the tagging efficiency of 
heavy flavor electron jets containing a conversion elec- 
tron pair. The mistag subtraction uncertainty is related 
to the asymmetry in negative tags vs. fake positive tags 
described in the next subsection, and is conservatively 
estimated by scaling the negative tag rates for all jets 
by either zero (no subtraction) or by a factor of two. 
Et dependence was described earlier, and the -B-decay 
uncertainty allows for a possible difference in the scale 



factor due to the lower charged particle multiplicity of 
scmilcptonic B decays compared to all possible decay 
modes. Combining all systematic and statistical errors 
we obtain a data to Monte Carlo tagging efficiency scale 
factor of 0.82 ± 0.06. 



TABLE III: Relative uncertainties on the data to Monte Carlo 
tagging efficiency scale factor, in percent. 



Source 


uncertainty (%) 


F HF 


3.5 


Fh F method 


3.0 


mistag subtraction 


3.0 


Et dependence 


2.5 


B-decay 


1.2 


total systematic error 


6.2 


data statistics 


3.2 


MC statistics 


3.6 


total uncertainty 


7.8 



A variation of the double-tag technique has also been 
studied which uses the single-tag rate of electron jets 
rather than the measurements of -Fhf- First we write 
the efficiencies in the data as e = SF x £mc and 
e sin g ic = sp x £ singic^ wherg £ . g defined in Equation Q] 



and e singl ° = (N e 



N e -)/(F HF N) is the net single-tag 



efficiency for heavy-flavor electron jets. Although both e 
and e smgle are tag efficiencies for heavy-flavor jets, they 
generally differ because the requirement of an away jet 
tag suppresses the charm content of the sample relative 
to bottom. 

Substituting for e and Fhf (using the relation between 
F HF and SF x e^cf 6 ) into Equations lA~15l and [QD al- 
lows solution for the efficiency scale factor SF directly 
in terms of the data tag and conversion rates, and of the 
MC tag efficiencies cmc and e^c' - ^ result of SF = 0.81 
is obtained, consistent with the method described above 
and with similar systematic and statistical errors. 



C. Measurement of the Mistag Rate 

A "mistag" is defined to be a jet which did not re- 
sult from the fragmentation of a heavy quark, yet has a 
SecVtx secondary vertex. Mistags are caused mostly by 
random overlap of tracks which are displaced from the 
primary vertex due to tracking errors, although there are 
contributions from Ks and A decays and nuclear inter- 
actions with the detector material (the beampipe or the 
inner silicon layers) as well. Contributions from these ef- 
fects are measured directly from jet data samples without 
relying on the detector simulation. 

Because the SecVtx algorithm is symmetric in its treat- 
ment of do and L2D significance, the tracking-related 
mistags should occur at the same rate for L 2 d > and 
L 2 d < 0. Therefore, a good estimate of the positive 
mistag rate due to resolution effects can be obtained from 
the negative tag rate. However, some of the negative 
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tags occur in jets which do contain heavy flavor, so that 
part must be subtracted. In addition, the negative rate 
will not reflect the mistags due to lifetime or interactions 
with the detector material. Corrections for all of these 
effects arc determined using fits to the pseudo-cr spectra 
of tagged vertices, described in Section IV CI The sum of 
these corrections is found to be 20 ± 10% of the negative 
tag rate, consisting of a subtraction of 20% for removal 
of the heavy flavor negative tags, and an addition of 40% 
to account for the mistags due to lifetime and material 
interactions. 

The rate of negative tags for taggable jets is measured 
in an inclusive sample of jet triggers. The rate is param- 
eterized as a function of four jet variables - Et, track 
multiplicity, ry, and <fr - and one event variable "EE?, the 
scalar summed Et of all jets in the event with Et > 10 
GeV and \r]\ < 2.4. These parameterized rates are used 
to obtain the probability that a given jet will be nega- 
tively tagged. 

The full five-dimensional tag rate matrix was deter- 
mined using inclusive 20 GeV, 50 GeV, 70 GeV, and 100 
GeV jet trigger samples, for a total of 11.5 x 10 6 events. 
Figure[3shows the negative tag rate per taggable jet as a 
function of jet Et and track multiplicity (and integrated 
over the other variables) for all the events in the inclu- 
sive jet sample. These rates have not been scaled by the 
1.2 ±0.1 correction discussed above which is applied to 
convert to an estimate of the positive mistag rate. 

Detailed cross checks were performed on the tag rate 
matrix to verify its self-consistency and to check pre- 
dictability and sample dependence. Both the total tag 
rates and the tag rates as functions of various quanti- 
ties were used to check how well the matrix predicts 
the observed data and to estimate systematic errors. 
Table IIVI summarizes the differences between the ma- 
trix predictions and the observed tag rates in various 
validation samples. The four jet trigger samples de- 
scribed above were used, along with an independently- 
triggered sample requiring four jets with Et > 15 GeV 
and TjEt > 125 GeV, referred to as the "SumEt" sample. 
The table is divided into two sections. Each row in the 
table compares the tag rate predicted from one sample 
with the observed rate in a second, different sample. 

The differences in the tag rates of trigger jets and non- 
trigger jets are well predicted by the matrix. This is 
mostly due to the inclusion of the jet Et, T), and <fi into 
the matrix binning. The remaining residual difference is 
taken as a systematic error in the final result. 

The systematic uncertainties assigned to the tag rate 
matrix predictions are summarized in Table E] We as- 
sume that the various contributions are uncorrelated and 
add them in quadrature to find a total systematic uncer- 
tainty of 8% on the negative tag rates, which combined 
with the uncertainty on the correction factor 1.2 ± 0.1 
yields a total mistag rate relative uncertainty of 11%. 



S 0.016 

C£ 

M 0-014 
S3 

H 0.012 
0.01 : 
0.008 
0.006 '- 
0.004 ; 
0.002 : 

()E 



20 40 60 80 



100 120 140 160 
Jet Et (GeV) 



<3 0.07 F- 
as 

H 0.05 



0.04 
0.03 
0.02 
0.01 




8 10 12 14 

Number of Good Tracks 



FIG. 7: SecVtx negative tag rate as a function of jet Et and 
track multiplicity in the inclusive jet data. 



V. HEAVY FLAVOR CONTRIBUTIONS TO 
W+JETS 

Heavy flavor production in association with a vector 
boson (e.g. Wbb, Wcc, Wc) contributes significantly to 
the non-it background in the 6-tagged lepton + jets sam- 
ple, even though W + light flavor jet production dom- 
inates the pretag sample. Several Monte Carlo genera- 
tors are capable of performing matrix element calcula- 
tions for W/Z + jets, even to high jet multiplicity, but 
these generators use leading-order calculations. As a re- 
sult, the overall normalization of these calculations has 
a large theoretical uncertainty, even though the relative 
contributions of the important diagrams are well-defined. 

For this reason, the relative fraction of W + heavy 
flavor production is calculated in a matrix element Monte 
Carlo program, and the overall normalization of the W 
+ jets production is measured with collider data. The 
two results can be combined to estimate the W + heavy 
flavor background. 

For this analysis, we use a new event generator, ALP- 
GEN [2{| , which calculates exact matrix elements at lead- 
ing order for a large set of parton level processes in QCD 
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TABLE IV: Differences in predicted and observed negative tagging rates for various samples. The first four rows with labels of 
the form Samplel-Sample2 compare observed tag rates in Sample 2 to the rates predicted by a matrix made from Sample 1. The 
last three rows compare the observed tag rates for trigger jets, non-trigger jets, and jets in the SumEt sample with predictions 
from the standard mistag matrix derived from all four jet samples. 





Observed Negative Tag Rate (%) 


Predicted Negative Tag Rate (%) 


Obs./Pred. 


Jet20-Jet50 


0.728± 0.008 


0.677±0.046 


1.08± 0.08 


Jet50-Jet70 


0.958± 0.009 


0.930±0.013 


1.03± 0.02 


Jet50-Jetl00 


1.219± 0.009 


1.151±0.044 


1.06± 0.04 


Jet50-SumEt 


0.730± 0.005 


0.712±0.015 


1.03± 0.02 


Trigger Jet 


0.565± 0.005 


0.587±0.005 


0.96± 0.01 


Non- Trigger Jet 


0.659± 0.005 


0.640±0.006 


1.03± 0.01 


SumEt 


0.712± 0.006 


0.726±0.007 


0.98± 0.01 



TABLE V: Systematic uncertainties assigned to the negative 
tag rate matrix predictions. 



Source 


Uncertainty 


Trigger jet bias 


4% 


Sample bias 


1% 


Statistics 


1% 


Total 


8% 



and electroweak interactions. All heavy quark masses, 
spins and color flows are treated properly inside ALP- 
GEN. Heavy flavor fractions calculated using ALPGEN are 
calibrated against fractions measured from jet data. 

The total W + heavy flavor contribution is estimated 
by multiplying the number of pretag W + jets events 
in data, given in Table |I] by the calculated W + heavy 
flavor fraction and the tagging efficiency in Monte Carlo 
(including the SecVtx efficiency scale factor between data 
and Monte Carlo). Because the event tagging efficiency 
depends on the number of heavy flavor jets in the fiducial 
region \t)\ < 2.4, we calculate results separately for the 
case of 1 and 2 heavy flavor jets. 



A. Heavy Flavor Monte Carlo Samples 

Parton-level events from the ALPGEN matrix element 
calculation are fed to the herwig parton shower pro- 
gram which generates additional jets from gluon radia- 
tion. The matrix element gives a good description of the 
production of a few, widely separated partons, whereas 
parton showers are better suited to model the emission of 
soft collinear gluons. Following a matrix element calcula- 
tion with a parton showering algorithm provides a better 
model of the data than does either approach separately. 

One outstanding issue for such a combined approach 
is how to avoid double counting in the region of phase 
space populated both by higher order matrix elements 
and the parton shower. Specifically, the radiation from 
the parton shower in a W + n parton Monte Carlo sam- 
ple can produce jets which cover part of the phase space 
described by the W + (n + 1) parton Monte Carlo. Al- 



though a rigorous combination prescription has been pro- 
posed to avoid such double counting, it has not yet been 
fully implemented in any of the matrix element Monte 
Carlo programs [2^, |3(j ■ 

A simple procedure deals with the possible double 
counting by matching final state partons to reconstructed 
jets and rejecting events where the showering algorithm 
has produced a hard parton 0,H3- Events are rejected 
if there are extra jets which fail to match to the light 
partons generated at the matrix element level or if there 
are missing jets. In the special case of heavy flavor par- 
tons, the strict matching criteria are relaxed because two 
partons may be merged into one jet due to the parton 
mass. Although it minimizes double counting of gen- 
erated events, this procedure introduces a new type of 
systematic uncertainty which depends on the matching 
criteria and the jet definition. 

The matching algorithm is applied at the stable gener- 
ated particle level, before any detector simulation. Sta- 
ble particles after the parton shower are required to have 
Pt > 0.4(0.0) GeV/c for charged (neutral) particles and 
|ry| < 3. The jet clustering is a simple cone clustering 
scheme where the number of final jets (particles) is re- 
duced by joining the two closest jets (particles) within 
a cone of radius AR = 0.4 into one. Once all possible 
merging is completed, the jet four-momentum is recal- 
culated using all of the particles inside the jet cone. A 
stable-particle jet is required to have Et > 10 GeV and 
|ry| < 2.4, and the matched parton must fall within a cone 
radius of 0.4. 

The following requirements reduce event double count- 
ing after the parton shower: 1) reject events in which an 
extra jet failed to match any parton from the matrix el- 
ement calculation, 2) ignore matching requirement for 
heavy flavor partons because the effect of their masses 
has been included in the matrix element calculation, and 
3) keep only the events which pass the strict jet-light 
parton matching. 

Fully exclusive matched events in each matrix element 
Monte Carlo sample are summed, weighting by the ap- 
propriate cross sections. Because the double-counted 
events have been removed by the matching procedure, 
this combined sample should reproduce the W + jets 
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FIG. 8: Observed JU + jets cross section compared with the 
ALPGEN W + jets prediction as a function of number of jets. 
(Only statistical errors are shown, and the results are not 
corrected for acceptance.) 



TABLE VI: Summary of systematic uncertainties in the heavy 
flavor fraction determination. 



Source 


Uncertainty 


Fractions 


Wbb 


Wcc Wc 


Matching criteria 


15% 


15% 10% 


Q 2 scale (2M^ to 0.5M^) 


4% 


4% 5% 


PDF 


5% 


5% 10% 


Jet energy scale 


5% 


5% 10% 


ISR/FSR 


10% 


10% 10% 


6, c masses (4.75, 1.55 ± 0.3 GeV/c 2 ) 


6% 


10% 


Total 


21% 


22% 21% 



and 0.5m^), parton distribution functions (among the 
20 eigenvector pairs from CTEQ6M[3j^), and the heavy 
quark mass (±0.3 GeV). The relative systematic uncer- 
tainties in Tablc lvTl arc applied to all jet multiplicity bins. 
The final measured heavy flavor fractions for W + jets 
events can be found in Table IVTTll 



data. These results are stable in terms of different match- 
ing algorithms, cone size, and jet Et requirement. The 
predicted W + jets cross section, without any acceptance 
correction, is plotted in Figure|Hlwith the measurement in 
the electron and muon channels. The non-VF and diboson 
backgrounds as well as the expected contribution from tt 
production are subtracted for this measurement. Even 
though the overall normalization of the Monte Carlo does 
not reproduce the data very well, the jet multiplicity de- 
pendences in data and Monte Carlo are in good agree- 
ment. 



B. Heavy Flavor Fraction in Simulated W + jets 
Events 

The heavy flavor fractions for W + jets events, com- 
puted using an ALPGEn/herwig Monte Carlo sample, 
are defined to be the ratio of the observed W 4- heavy 
flavor and W + jets cross sections. 

The matching algorithm operates with particle-level 
jets, but jets from a full calorimeter simulation provide 
better agreement with jets in data. A detector-level jet is 
required to have Et > 15 GeV and \r)\ < 2, and a heavy 
flavor jet is required to match to any b or c parton inside 
a cone with AR — 0.4. 

A summary of systematic uncertainties inherent in this 
heavy flavor fraction measurement is presented in Ta- 
ble lVIl The matching uncertainty is estimated by recom- 
puting the heavy flavor fraction after varying the match- 
ing cone sizes (0.4, 0.7) and E T from 10 to 15 GeV. We 
take half of the difference in the 4-jet bin as the matching 
systematic uncertainty. Uncertainties due to the interac- 
tion energy scale Q 2 , PDFs, and heavy quark masses are 
calculated by comparing the ratio of the Wbb + 1 parton 
and W + 3 partons cross sections from ALPGEN and esti- 
mating the variation by changing the Q 2 (between 2m^ 



C. Calibration of Heavy Flavor Fraction Using Jet 

Data 

With the current data sample and a limited number of 
SecVtx-tagged W + jets data events, it is difficult to ver- 
ify the ALPGEN heavy flavor fractions in W + jets events 
directly with data. Fortunately, an inclusive jet sam- 
ple, without identified W bosons, is a large related class 
of events whose production processes are described by 
Feynman diagrams similar to those of W + jets events. 
In particular, gluon splitting to heavy quark pairs ac- 
counts for part of the heavy flavor production in both 
samples. The inclusive QCD jet sample can be used to 
compare the heavy flavor fractions calculated in Monte 
Carlo with results from data. Any discrepancy between 
heavy flavor fractions in data and Monte Carlo could then 
be used to adjust the calculated heavy flavor fractions in 
W + jets events. 

Heavy flavor fractions are calculated in both pythia 
and ALPGEN+herwig Monte Carlo jet samples. Events 
are required to have 2 or 3 jets with Et > 15 GeV and 
|ry| < 2.0 and at least one jet with E T > 20 GeV to 
satisfy trigger requirements. Events from the ALPGEN 
sample must also pass the matching algorithm described 
in Section IVAl 

Contributions to the jet data sample from heavy and 
light partons are determined by fitting the pseudo-cr dis- 
tribution for tagged jets, thereby discriminating between 
jets from 6, c, and light partons or gluons on a statis- 
tical basis. Pseudo-cr is defined as L 2 d x M vtx /p T tx , 
where M ¥tx is the invariant mass of all tracks in the sec- 
ondary vertex and p T tyi is the transverse momentum of 
the secondary vertex four-vector. Even though the L 2 u 
distribution is similar for b and c quarks, the pseudo-cr 
is very different for the two flavors. 

The fit is made more robust by subtracting the contri- 
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FIG. 9: Pseudo-cr distribution for jet data, including fitted 
contributions for the different components of heavy flavor and 
secondary interactions in light flavor jets. 



bution from negative SecVtx tags and fitting the differ- 
ence only, as shown in Fig. [§] Template distributions of 
the pseudo-cr for b and c jets are derived by matching 
jets to partons in Monte Carlo, and a separate template 
is created for secondary interactions in light quark jets, 
including material interactions and long-lived A and 
particles. 

If the signed decay length distribution of secondary 
vertices in light flavor jets and from tracking combina- 
torics were symmetric about zero, then the number of 
fake positive tags from light flavor could be simply es- 
timated by counting the number of negative tags. Un- 
fortunately, secondary vertices from material interactions 
or long-lived light flavor particles are more likely to have 
positive decay lengths than negative decay lengths, and 
there are some real heavy flavor jets with negative de- 
cay lengths. The heavy flavor contribution with negative 
decay lengths is first estimated from Monte Carlo, and 
then scaled by a factor of 1.6 ±0.3 to account for a larger 
overall observed negative tag contribution in data than 
in Monte Carlo events. 

The net excess of secondary interactions on the positive 
side, AN, is computed from the secondary contribution 
fit results, after subtracting the heavy flavor contribu- 
tions on the negative side. The resulting average correc- 
tion factor AN/N needed to scale the number of negative 
tags to obtain the correct number of fake positive tags 
is 1.2 ± 0.1. This average factor is applied uniformly to 
all jets, independently of jet Et and other jet properties. 
The uncertainty on this factor is due to the uncertainties 
in the fit templates and the difference in AN/N between 
the different jet Et bins. 

The heavy flavor fraction as a function of jet Et is 
stable, as shown in Table IVIII where an uncertainty of 
5% (10%) for the b (c) fraction is included due to tem- 
plate uncertainties. These results include the effect of 
the efficiency scale factor between data and simulation. 
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FIG. 10: Distribution of closest jets in Acf> for tagged 3-jet 
data events with fake tag prediction (top) and for double- 
tagged events in which the tagged jets are also the two closest 
jets (bottom). 



Measured heavy flavor fractions from the data are consis- 
tently 50% higher than the ALPGEN prediction, for both 
b and c jets, although the pythia calculation seems to 
match the data more closely. These heavy flavor fractions 
are compared with the heavy flavor fractions calculated 
using ALPGEN inclusive jet Monte Carlo with the match- 
ing prescription. On average the data/ALPGEN ratio is 
1.5 ± 0.4, where the uncertainty is dominated by the sys- 
tematic uncertainties associated with the ALPGEN heavy 
flavor calculations ( Table IVT|) . From these fits alone, it is 
not clear if the discrepancy is consistent for all produc- 
tion diagrams or only for some subsets of gluon splitting 
to heavy flavor partons. 

Because jets with gluon splitting have a small opening 
angle, the distribution of Atfi between the two closest jets 
in an event highlights the contribution from gluon split- 
ting. A sample of events with 2 tagged jets is selected 
from the 3-jet sample and compared to Monte Carlo. The 
mistag contribution is removed from the double-tagged 
samples by subtracting events with one or more negative 
tag. The good agreement, shown in Fig. [23 indicates 
that the gluon splitting contribution relative to other pro- 
duction mechanisms is well-modelled. 

Another sample with gluon splitting contributions, this 
time of single-tagged 3-jet events, can be used to check 
the dependence of the data/ALPGEN normalization fac- 



17 



TABLE VII: Fitted contributions from 6, c jets and secondary interactions or long-lived light flavor particles in data events. 
The uncertainties on the b and c fractions are total uncertainties including 5% and 10% uncertainties due to the templates. 
The ratio AN/N estimates the excess of positive over negative tags in data events, due to secondary interactions and long-lived 
light flavor particles. 



E T (GeV) 


E T < 25 


25 < E T < 35 


35 < E T < 45 


E T > 45 


All 


_1_ CI t__ CL I J 


858 643 


415 373 

1 Y ' ) * 'J 1 'J 


128 994 


77,632 


1 480 642 


Pos. - Neg. 


12,208 


7131 


2511 


1596 


23,446 


Negative 


3283 


1999 


803 


697 


6782 


Fitted bs 


7937 ± 483 


4412 ±312 


1609 ± 131 


843 ±102 


15, 147 ± 507 


Fitted cs 


3040 ± 427 


1858 ± 276 


520 ± 110 


407 ± 93 


5589 ± 451 


Secondary 


1284 ± 142 


900 ± 102 


379 ± 50 


324 ± 39 


2836 ± 171 


A7V 


482 ± 224 


431 ± 144 


230 ± 59 


227 ± 44 


1336 ± 365 


AN/N(%) 


15 ±7 


22 ±7 


29 ±7 


32 ±7 


20 ±5 


6s/Jets (%) 


0.92 ± 0.08 


1.06 ±0.10 


1.25 ±0.12 


1.09 ±0.14 


1.02 ±0.06 


cs/Jets (%) 


0.35 ± 0.06 


0.45 ± 0.08 


0.40 ±0.10 


0.52 ±0.13 


0.38 ± 0.05 
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FIG. 11: The positive tag excess rate in data and ALPGEN 
Monte Carlo as a function of A<j>. 



tor. When the excess tag rate, interpreted as the heavy 
flavor fraction, is plotted as a function of minimum A(f> 
between jets (Fig. 111(1 . there is no evidence of structure in 
the fractions as a function of A</> even though the heavy 
flavor fractions in data are still 1.5 times the heavy flavor 
fractions in Monte Carlo. This consistency disfavors the 
hypothesis of missing or under-represented heavy flavor 
production diagrams. 

The measured ratio of 1.5 ± 0.4 between the heavy 
flavor fractions in the ALPGEn/herwig samples and the 
data is not inconsistent with other recent studies, which 
indicate that a A"-factor may be necessary to account 
for higher-order effects j^. Based on this calibration 
with the jet data sample, we scale the expected Wbb 
and Wcc background contributions derived from ALPGEN 
by a factor of 1.5 ± 0.4. Since the Wc background is 
produced through a different diagram, that contribution 
is not rescaled. 

Table IVIlTl summarizes the one and two b (c) fractions 
as a function of jet multiplicity, as well as the correspond- 



ing SecVtx tagging efficiencies, where the efficiency scale 
factor, as calculated in Section llVl has already been ap- 
plied. The IB (1C) fractions are for events with exactly 
one jet matched to a b (c) parton, and the 2B (2C) frac- 
tions are for events with exactly two jets matched to b (c) 
partons. These values are used in Section TVI CI to pre- 
dict the background contribution from W ± heavy flavor 
production 



VI. BACKGROUNDS IN THE TAGGED W + 
JETS SAMPLE 

The non-ii events in the W ± jets sample are from 
direct QCD production of heavy flavor without an as- 
sociated W boson, mistags of light quark jets in W + 
jets events, W + heavy flavor production, and other low 
rate electroweak processes with heavy flavor such as di- 
boson and single top production. The estimation of each 
of these backgrounds is described in turn. 



A. Non-W QCD Background 

The non-iy QCD background is a mixture of events 
where the lepton does not come from the decay of a W or 
Z boson. These include lepton and missing energy fakes 
as well as semileptonic b hadron decays. Since several 
backgrounds are calculated by normalizing to the num- 
ber of W + jets events before tagging, it is necessary to 
understand the level of QCD contamination in the pretag 
sample. In addition, some of these non-W-^ QCD events 
may be 6-tagged. Both the pretag and tagged contribu- 
tions are measured directly from data events. 

In a leptonic W decay, the lepton is isolated and there 
is large $ T due to the neutrino, while in non-W events 
this is not necessarily true. We define the lepton isola- 
tion, 2soi, as the ratio of energy (not due to the lepton) 
in the calorimeter in a cone around the lepton direction 
to the measured electron (muon) energy (momentum). 
Isolated leptons will have small values of Zsd . Sideband 
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TABLE VIII: Ratio of W ± heavy flavor production to total W ± jet production, for different jet multiplicities. The heavy 
flavor ratios include the correction factor 1.5 ± 0.4 as measured from jet data, and the SecVtx event tagging efficiencies include 
the scale factor described in Section IIVI These values are used in Section lVl Cl to predict the background contribution from W 
± heavy flavor production. 



Jet multiplicity 


1 jet 


2 jets 


3 jets 






>4 jets 


H T (GeV) 






H T > 


Ht > 200 


H T > 


H T > 200 


W + HF fractions (%) 


IB 


1.0 ±0.3 


1.4 ±0.4 


2.0 ±0.5 


2.4 ±0.6 


2.2 ±0.6 


2.2 ± 0.6 


2B 




1.4 ±0.4 


2.0 ±0.5 


2.3 ±0.6 


2.6 ±0.7 


2.6 ±0.7 


1C 


1.6 ±0.4 


2.4 ±0.6 


3.4 ±0.9 


3.8 ± 1.0 


3.6 ± 1.0 


3.5 ± 1.0 


2C 




1.8 ±0.5 


2.7 ±0.7 


2.9 ±0.8 


3.7 ± 1.0 


3.7 ± 1.0 


Wc 


4.3 ±0.9 


6.0 ±1.3 


6.3 ±1.3 


6.0 ± 1.3 


6.1 ±1.3 


5.9 ±1.3 






SecVtx ta{ 


;ging efficiencies (%) 








1B(> ltag) 


26.8 ± 2.0 


27.8 ±2.2 


29.3 ± 2.5 


30.9 ±2.9 


24.2 ±3.3 


27.4 ± 3.8 


2B(> ltag) 




48.6 ±3.2 


50.0 ±3.8 


52.6 ± 4.5 


50.3 ±4.9 


50.0 ±5.1 


2B(> 2tags) 




9.1 ± 1.4 


9.5 ± 1.5 


10.4 ± 1.6 


8.1 ± 1.4 


8.6 ± 1.5 


1C(> ltag) 


6.2 ±0.9 


6.7 ± 1.0 


6.1 ± 1.1 


6.6 ± 1.3 


7.7 ± 1.9 


7.5 ±2.0 


2C(> ltag) 




12.3 ± 1.9 


11.6 ± 2.0 


12.6 ± 2.5 


10.1 ± 2.3 


9.6 ±2.4 


2C(> 2tags) 




0.5 ±0.2 


0.4 ±0.1 


0.5 ±0.2 


0.8 ±0.4 


0.9 ±0.4 


Wc (> ltag) 


5.8 ±0.9 


6.1 ±0.9 


7.1 ± 1.2 


7.6 ± 1.5 


5.6 ± 1.6 


5.8 ± 1.8 



regions for lepton isolation and $ T in the high-pr lep- 
ton sample contain mostly non-II^ events and are used to 
extrapolate QCD expectations in the signal region. The 
sideband regions are defined as follows: 

1. Region A: I sol > 0.2 and $ T < 15 GeV 

2. Region B: X so \ < 0.1 and $ T < 15 GeV 

3. Region C: l sol > 0.2 and $ T > 20 GeV 

4. Region D (W signal region): l so \ < 0.1 and $ T > 
20 GeV 

For the QCD background these two variables are as- 
sumed to be mostly uncorrelated: the ratio of won-W 
events at low and high X ao \ values in the low $ T region 
is the same as in the high $J T region. The number of 
non-PF events in the signal region is estimated by 



QCD 



D 



N B x N c 
N A 



(3) 



The contribution of true W and tt events in the side- 
band regions is estimated using Monte Carlo samples to 
determine the ratio of W and tt in the signal and side- 
band regions, and normalized to the observed number 
of events in the pretag signal region. The correction is 
5-30% depending on the lepton type and event jet mul- 
tiplicity. 



QCD background fraction in the signal region. The main 
source of systematic uncertainty is the underlying as- 
sumption that the lepton isolation and ]ft T are uncor- 
related for this background. A study of non-isolated 
lcptons indicates that this assumption adds a 25% sys- 
tematic uncertainty to the non-VF QCD background es- 
timate. 



2. Tagged Backgrounds 

Some of the uon-W QCD events are 6-tagged and end 
up in the final event count. One estimate of this con- 
tribution applies Equation[3]to the tagged event sample, 
but this method is limited by the tagged sample size. To 
increase the number of events, regions A and C are re- 
defined by lowering the isolation boundary to the edge 
of the signal region, X so \ > 0.1. The precision on this 
estimate is limited by the number of tagged events in the 
sideband regions. 

A second method scales the pretag QCD fraction by 
the average tagging rate for QCD events. This method 
has the advantage of normalizing the background with 
the larger statistics of the pretag sample, but requires 
a reliable estimate of the tag rate. The tagging rate in 
region B for events with two or more jets is applied to 
the number of taggable jets in the signal region times the 
pretag QCD background fraction. 

Both background estimates contribute to the weighted 
average shown in Table ITXl 



1. Pretag Backgrounds 

The non-VF QCD background is calculated separately 
for the electron and muon channels, as well as for dif- 
ferent jet multiplicities. Table IIXI gives the predicted 



B. Mistags 

Mistag background events are W ± jets events where 
the tagged jet does not result from the decay of a heavy 
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TABLE IX: Non-W QCD background estimate. Results from the tag rate method and the tag sample method are the number 
of events expected in the b-tagged lepton + jets sample. 











Ht 


> 


H T > 


200 GeV 




Jet multiplicity 


1 jet 


2 jets 


3 


jets 


> 4 jets 


3 jets 




> 4 jets 


Electrons 


Pretag non-W QCD Fraction 


0.14 ± 0.04 


0.17 ± 0.04 






0.20 ± 0.05 








Tag Rate Method 


16.3 ± 4.7 


7.4 ± 2.2 




3.2 


£ 1.0 


2.1 


± 


0.6 




Tag Sample Method 


21.8± 3.8 


10. 0± 2.2 




4.9 


± 1.3 


2.6 


± 


0.8 




Combined Tag Estimate 


19.6 ± 3.0 


8.7 ± 1.6 


2.7 


± 0.6 


1.1 ± 0.2 


1.3 ± 0.3 




1.0 ± 


0.3 


Muons 


Pretag non-W 7 QCD Fraction 


0.034 ± 0.010 


0.043 ± 0.011 






0.075 ± 0.023 








Tag Rate Method 


4.0 ± 1.3 


1.2 ± 0.6 




0.7 


± 0.3 


0.5 


± 


0.1 




Tag Sample Method 


4.8± 1.1 


1.8± 0.5 




1.3 


£ 0.4 


1.0 


± 


0.3 




Combined Tag Estimate 


4.5 ± 0.8 


1.5 ± 0.4 


0.7 


± 0.2 


0.2 ± 0.1 


0.3 ± 0.1 




0.3 ± 


0.1 


Electron+Muon 


24.3± 3.5 


10.5± 1.9 


3.4± 0.7 


1.3± 0.3 


1.6± 0.4 




1.2± 


0.4 



quark. As described in Sec. UVCl the mistag rate per jet 
is parameterized as a function of the number of tracks, 
the raw jet Et, the 77 and 4> of the jet, and the sum of 
the Et for all jets in the event with Et > 10 GeV and 
1 77 1 < 2.4. To estimate the size of the mistag background, 
each jet is weighted with its mistag rate in the pretag 
sample. The sum of the weights over all jets in the sam- 
ple is then scaled down by the fraction of pretag events 
which are due to QCD background, as in Sec. IVI A II 
since these have already been counted in the procedure of 
Sec. IVI A~2l The low mistag rate per jet means that a neg- 
ligible number of events have more than one mistagged 
jet; therefore, the number of mistagged jets is a good ap- 
proximation of the number of events with at least one 
mistagged jet. This method is tested by comparing the 
negative SecVtx tags observed and predicted for the pre- 
tag sample as a function of the jet Et, plotted in Fig- 
ure^! There is reasonable agreement in the shape and 
normalization of the prediction. 

For the estimate of the number of fake positive tags, 
the mistag correction factor of 1.2 ± 0.1 described in 
Sec. IV CI is applied to account for additional mistags of 
light quark jets due to material interactions or long lived 
light quark hadrons. The final results for the mistag es- 
timate are shown in Table IXTTTl The error includes sta- 
tistical uncertainties from the pretag sample, including 
the small effect of correlation between mistag weights 
that come from the same bin. In addition, an 11% total 
systematic uncertainty includes uncertainty due to the 
sample dependence of the mistag rate parameterization 
and the uncertainty on the mistag correction factor of 1.2 
for the positive/negative mistag asymmetry. 



C. W + Heavy Flavor Backgrounds 

The production of W bosons associated with heavy 
flavor in the processes Wbb, Wcc, and Wc is a signifi- 
cant part of the background for the tagged sample. The 
techniques described in Sec. El are used to estimate the 
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FIG. 12: Comparison of observed and predicted negative 
SecVtx tags vs. jet Et in the lepton + jets sample. 



fraction of the inclusive W + jets events which have Wbb, 
Wcc, and Wc. The number of Wbb, Wcc, and Wc events 
is given by multiplying the heavy flavor fractions by the 
pretag event count, after subtracting the non-W back- 
grounds. Estimates of the tagged background are then 
obtained by multiplying the tagging efficiencies summa- 
rized in Table IVTlTl 

The pretag W + jets sample includes some contribu- 
tion from misidcntificd Z — > /i + /i~ events. The heavy 
flavor fraction for that process is twice as large as for the 
W events. The extra contribution of heavy flavor from Z 
events is described in Section fVIII and given in Table Ixl 
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Corrections due to tt contributions in the pretag events 
are discussed in Section ITXI 



D. Other Backgrounds 

A number of backgrounds are too small to be mea- 
sured directly, thus we use the Monte Carlo to predict 
their contribution to the sample. The diboson produc- 
tion processes WW, WZ, and ZZ, in association with 
jets, can mimic the tt signal when one boson decays lep- 
tonically and the other decays to a taggable b or c quark 
jet. The process Z — ► t + t~, in association with jets, 
can mimic the signal when one r decays leptonically and 
the other hadronically. Top quarks are expected to be 
produced singly with a tb final state through s-channel 
qq annihilation, and i-channel VF-gluon fusion processes. 

We use Monte Carlo samples to measure the accep- 
tance and tagging efficiency. The Monte Carlo accep- 
tance is corrected for the lepton identification and trig- 
ger efficiencies as is done for the tt acceptance as de- 
scribed in Section IVIIII The tagging efficiency is scaled 
by the MC/data tagging scale factor, with double the 
uncertainty for tagging charm jets as in W — > cs. 
The normalization is based on the measured integrated 
luminosity and the following theoretical cross sections 
cr(singletop) = 2.86±0.09pb, a(WW) = 13.25 ±0.25pb, 
a(WZ) = 3.96±0.06pb, and a(ZZ) = 1.58±0.02pb [H 



E. Background Summary 

A complete summary of all of the background con- 
tributions is given in Table IXTTTl Figure 031 shows the 
contribution of the different backgrounds for each jet 
bin compared to the number of data events satisfying 
all of the selection criteria and having at least one posi- 
tively tagged jet. We find good agreement between back- 
ground and data in the one and two jet bins, validating 
our background calculation. The excess of tags in the 
three and four jet bins is attributed to tt. We have al- 
ready described how the estimates for Wbb, Wcc, Wc 
and mistags, which depend on the number of W pretag 
events in the data, are corrected for the contribution of 
QCD backgrounds to the pretag sample. A similar cor- 
rection needs to be made to account for the real tt in the 
pretag sample. This is done as part of the cross section 
measurement as described in Section IIXI We find the 
pretag sample to be 10-15% tt in the three jet bin and 
40-50% tt in the four jet bin. 



VII. CROSS CHECK USING THE Z + JETS 
SAMPLES 

An investigation into the Z + jets sample provides a 
jood cross check on our background calculations since 
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FIG. 13: Number of events passing the selection criteria with 
at least one tagged jet, and the background prediction for the 
same selection. The Ht requirement has not been applied. 



the tt and non-TF QCD contamination in these samples 
is small. 

The heavy flavor contribution in Z + jets is expected 
to be close to that in W + jets in terms of gluon splitting. 
However, there is an additional diagram gg — ► Zbb, which 
is not present in W + jets. We use the same procedures 
described in previous sections to estimate the heavy fla- 
vor fractions using the Z + jets alpgen Monte Carlo 
samples. The fraction of Zee events (including Zc) is 
approximately twice the fraction of Wcc events, and the 
fraction of Zbb events is approximately twice the frac- 
tion of Wbb events. The heavy flavor fractions in Z + 
jets are therefore estimated by multiplying the above fac- 
tors with the heavy flavor fractions in W + jets listed in 
Table EIII1 

Events with a Z boson are selected by identifying op- 
positely charged e + e~ and pairs with an invariant 
mass between 75 and 105 GeV/c 2 . Both leptons are re- 
quired to pass the tight lepton selection used for the W 
+ jets analysis in order to collect a pure sample of Z 
candidates. 

Tablc[X]lists the yield of Z candidates and the number 
of tagged events observed as a function of jet multiplic- 
ity. The background predictions are also given and are 
calculated in the same way as in the previous sections for 
the W + jets sample: 14.0 ± 1.9 events are predicted and 
18 are observed in the Z + jets sample fFig. I14|) . 

Some Z + jets events which fail the standard Z re- 
moval contribute to the W + jets sample. The fraction of 
Z — ► n + fj,~ events left in the W sample is about 72 ± 8% 
of the number of events observed in Z — ► decay. 
The contribution of Z — ► e + e~, on the other hand, is neg- 
ligible. Since those Z events left in the W sample have 
a higher heavy flavor fraction than the W events, a cor- 
rection factor accounts for the additional tagged events 
expected in the W + jets sample. 
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TABLE X: The predicted number of Z + jets events and the 
observed number, along with the Z + jets contribution left in 
the W + jets sample and the estimate of the resulting extra b 
tags in that sample. (The prediction of extra 6-tagged events 
is included in the predicted background summary for the W 
+ jets sample.) 



Jet multiplicity 


Z+l jet 


Z+2 jets 


Z+>3 jets 


Z -*■ e+e" 


410 


48 


10 


Z — > fjT 


402 


59 


15 


z -> e+r 


812 


107 


25 


Mistags 


2.4 ±0.2 


0.49 ±0.06 


0.23 ±0.04 


Zbb 


1.6 ±0.4 


0.8 ±0.2 


0.26 ±0.08 


Zee 


4.4 ± 1.3 


2.3 ±0.7 


0.8 ±0.2 


top ( fftr = 5.6 ± 1.4) 


0.08 ±0.02 


0.5 ±0.1 


0.13 ±0.03 


Pred. Total 


8.5 ±1.7 


4.1 ±0.9 


1.4 ±0.3 


Observed Events 


12 


3 


3 


Pretag VK+jets 


289 ± 35 


42 ±7 


11 ±3 


Tagged in W+jets 


1.1 ±0.3 


0.6 ±0.2 


0.2 ±0.1 
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FIG. 14: Comparison of the observed and predicted number 
of events in the 6-tagged Z + jets sample. 



VIII. EVENT SELECTION OPTIMIZATION 
AND ACCEPTANCE 



The event selection described in Section UTTl combined 
with the requirement that at least one jet be positively b- 
tagged, yields a clean sample of top decays in the lepton 
plus jets channel; the expected signal over background 
ratio is of the order of 2:1. Several ways of optimizing 
the event selection were studied in order to maximize 
the significance of the cross section measurement, and an 
inclusive event variable was found to have the greatest 
power to discriminate tt signal from background events. 
The following section discusses event selection optimiza- 
tion using the Ht variable. 



A. Optimization with the Ht variable. 

The event quantity Ht is denned as the scalar sum 
of the transverse energy of all the kinematic objects in 
the event (transverse momentum for muons), including 
all jets with E T > 8 GeV and \r}\ < 2.5: 



all jets 



tt/T H~ + E> 



'electron 



or p^ uon 



Because of the large mass of the top quark, Ht, which 
is representative of the hard scatter of the event, tends 
to be significantly larger for tt events than for the back- 
grounds. FigurelTSIshows the distribution of the Ht vari- 
able after all selection cuts have been applied, including 
6-tagging, for tt Monte Carlo, and for the main back- 
grounds: W + heavy flavor, non-W QCD, and mistags. 
The W + heavy flavor distributions are taken from ALP- 
GEN Monte Carlo, but all other background shapes are 
estimated from data; the features visible at high Ht in 
the non-W QCD and mistags distributions are due to 
poor statistics in the control samples. 

The Ht distributions for the three major backgrounds 
are estimated using methods described in Section IVII 
The ALPGEN Monte Carlo generator is used to estimate 
the shape of the distribution for the W + heavy flavor 
background. The non-VF QCD background shape is eval- 
uated by selecting pretag events where the lepton is not 
isolated (isolation > 0.2), while all other kinematic cuts 
remain unchanged. This subsample is presumably domi- 
nated by QCD events with kinematic properties identical 
to the QCD background events that satisfy the event se- 
lection (isolation< 0.1). Each event in the sub-sample is 
then weighted by the total positive tagging rate measured 

The mistag back- 



from the jet sample (see Section lVI Bp 
ground shape is estimated from the pretag sample, where 
each event is weighted by the negative tag rate measured 
from the jet sample. Other backgrounds (which account 
for less than 10% of the total background) are included 
in the overall normalization, with the implicit assump- 
tion that their shape is not significantly different from 
the others. The ti contribution is normalized to the the- 
oretical cross section. 

Figure 1151 shows that signal and background can be 
separated by the use of the Ht variable. Figure ^] 
shows the signal over background ratio and cross sec- 
tion sensitivity as a function of an Ht cut, computed 
from figure The statistical sensitivity (S/\/S + B) is 
compared to the total sensitivity (S/ yS + B + (7(B) 2 , 
where <r{B) is the absolute systematic error on the back- 
ground estimate). Systematic uncertainties arising from 
the Ht cut itself are described in Section 11X1 they are 
small enough to be neglected in the optimization process. 
A cut requiring Ht > 200 GeV is found to be optimal: 
such a cut keeps 96% of the signal and rejects 39% of the 
background; this improves the signal over background ra- 
tio from 2 to 3 and the total significance on the tt cross 
section measurement by 6%. 
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FIG. 15: Distribution of the Ht variable for ti Monte Carlo, 
and for various backgrounds normalized to an integrated lu- 
minosity of 107 pb -1 . 
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FIG. 16: Estimate of S/B, statistical and total significance 
for 107 pb -1 integrated luminosity, as a function of Ht cut. 



B. Acceptance 

The acceptance is denned as the fraction of produced 
ti events that satisfy all trigger and selection criteria. 
It includes trigger efficiency, reconstruction efficiency, as 
well as the efficiency of the kinematic cuts, and of the 
6-tagging algorithm. The branching ratios of the various 
decay modes of the ti pair are included as well. 

The contributions to the acceptance are divided as fol- 
lows: 



€ft — e trig ' £zo ' e veto ' ^tt ' ^ 



lcp — id ' ^tag— event 



(4) 



< 60cm cut, e^ c 



where e ZQ is the efficiency of the \zq\ 
is the fraction of Monte Carlo ti events which pass all 
the selection cuts (except for 6-tagging) , e veto is the com- 
bined efficiency of the various vetoes (conversion removal, 



cosmic removal, dilepton and Z° rejections), etrig is the 
trigger efficiency for identifying high px leptons, and 
etag-evcnt is the efficiency to tag at least one jet in a 
ti event. fci op _id is a factor that corrects for the lep- 
ton identification efficiency difference between data and 
Monte Carlo. 

The efficiency of the zq cut is measured from data 
and found to be e Zo = 0.951 ± 0.005. The trigger ef- 
ficiency etrig is different for each type of lepton trigger: 
egfM = 0.9656 ± 0.0006, egg up = 0.887 ± 0.007, and 
egf* x = 0.954 ± 0.006. The factor fc lop _ id is evaluated 
by comparing a data sample of Z + jets events with a 
pythia Z sample, and found to be 1.00 for electrons, 0.95 
for CMUP muons, and 0.99 for CMX muons; because 
there are few Z + jets events at high jet multiplicity, we 
assign a 5% relative systematic uncertainty on fci ep -id- 

The efficiency e^ c is evaluated using a sample of 
pythia ti Monte Carlo events with top quark mass 
rn t = 175GeV/c 2 . Table fxTI summarizes the event se- 
lection acceptance for each type of lepton. The 6-tagging 
efficiency is measured from the same Monte Carlo sam- 
ple, and takes into account the ^-tagging scale factor 
(section llVfl by randomly keeping only 82% of all the 
tags, and discarding the others. We find the efficiency 
for tagging at least one jet in a ti event (after all other 
cuts have been applied, including Ht > 200 GeV) to be 
53.4±0.3(stat.)±3.2(syst.)%; the systematic uncertainty 
comes from the measurement of the 6-tagging scale fac- 
tor, described in Section HVI An important source of un- 
certainty comes from the measurement of the jet energy, 
which also affects the $J T and Ht measurements (energy 
scale): uncertainties relative to the ry-dependent detec- 
tor response, overall energy scale, non-linearity, fraction 
of energy lost outside the reconstructed jet cone, and 
multiple interactions are added in quadrature. Multiple 
interactions are soft interactions that can contribute to 
the jet energy measurement. They are not included in the 
simulation; instead, a small average correction is applied 
to each jet in the data. The energy of each jet is then 
shifted up and down in the MC by the uncertainty, and 
half of the difference in the acceptance (4.9%) is taken 
as a systematic uncertainty. The overall acceptance is 
3.84±0.03(sta t.)±0 .40(syst.)%, including all systematic 
effects. Table IXIII summarizes the dominant sources of 
uncertainty for the acceptance. 



IX. CROSS-SECTION FOR SINGLE- TAGGED 
EVENTS (> 1 B-TAGS) 

The production cross section follows from the accep- 
tance measurement and the background estimate: 



N, 



obs 



bk P 



(5) 



e« x £ 

where -ZV orjs and iVbkg are the number of total observed 
and background events, respectively, in the W + > 3 jet 
bins (see Table IXIIIfl ; e t j is the signal acceptance (see 
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TABLE XI: Summary table of the tt acceptance, for a top quark mass of 175 GeV/c 2 . 





CEM 


CMUP 


CMX 


Total 


Sample (total) 

# Events w/o b-tag 

Acc. w/o b-tag (%) 


o44,Z04 

15,893 
4.09±0.03±0.36 


OAA OfiA 
o44,Z04 

9791 
2.13±0.02±0.19 


OAA OKA 
o44,Z04 

3617 

0.959±0.016±0.085 


OAA 9£/1 
o44,Z04 

29301 
7.18±0.04±0.61 


# Tagged Events 
Tag Efficiency (%) 
Acc. with 6-tag (%) 
Integ. Lumi. (pb _1 ) 


8490 
53.4±0.4±3.2 
2.19±0.02±0.23 
162±10 


5202 
53.1±0.5±3.2 
1.14±0.01±0.12 
162±10 


1965 
54.3±0.8±3.3 
0.512±0.009±0.054 
150±9 


15657 
53.4±0.3±3.2 
3.84±0.03±0.40 



TABLE XII: Relative systematic uncertainties on the signal 
acceptance which are common to all lepton types. 



Quantity 



Relative error (%) 



Tracking Efficiency 
Energy Scale 
PDF 

ISR/FSR 
MC modelling 
Lepton ID 
fa-tagging 



0.5 
0.4 
4.9 
2.0 
2.6 
1.4 
5.0 
6.0 



Table 1X1(1 ; and C is the integrated luminosity. Many 
of the predicted backgrounds are based on the number 
of pretag data events, but that number includes a sig- 
nificant contribution from tt events. After subtracting 
this contribution from the pretag sample, the dependent 
backgrounds arc recalculated. The final background con- 
tributions for the single-tag selection are summarized in 
Table IXlTll and represented in Figure IT7I 

The properties of the selected candidate events are con- 
sistent with the expectations for tt pair production and 
background contributions. Figures IT51 and 1191 show the 
distribution of the event Ht and the tagged jet Et, and 
Figure 1201 shows the pseudo-cr of the tagged jets. 

For the optimized selection with the Ht requirement, 
and for a top quark mass m% — 175GeV/c 2 , 



5.6t}-j(stat.)+°;g(syst.)pb. 



(6) 



The systematic uncertainty is due to uncertainties on 
the signal acceptance (10% relative), luminosity measure- 
ment (6%), and background estimate (5%). The accep- 
tance, and therefore the measured cross section, changes 
with the top quark mass as shown in Table IXIVl 



X. CROSS CHECK USING DOUBLE- TAGGED 
EVENTS (> 2 fa- TAGS) 

Each tt event contains two energetic b quarks, mak- 
ing it likely that two jets in the event will be tagged. 
Of the 57 tagged events in the three and four jet bins 
before the Ht cut, 8 of these are double-tagged events. 
The double-tag sample provides a cleaner tt sample in 
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FIG. 17: Background and tt signal expectation (based on 
measured ti cross section) as a function of jet multiplicity. 
Events with three or more jets are required to have Ht > 
200 GeV. 



which to cross-check the cross section with a significantly 
smaller systematic uncertainty due to the background es- 
timate, although with decreased statistical precision. 



1. Double-Tag Backgrounds 

The background estimate for the double-tag selection 
uses the methods described in Section IVll except for a 
few additional issues which are specific to the double-tag 
estimate. 

The mistag estimate for double-tags is dominated by 
events with one real tag of a heavy flavor jet with the 
second tag coming from the mistag of an additional jet. 
The mistag estimate is obtained by applying the mistag 
matrix to the jets in the tagged sample, in contrast to the 
pretag sample used for the inclusive estimate. Since the 
tagged sample with three or more jets is dominated by tt 
events, applying the mistag matrix to the entire tagged 
sample gives an overestimate of the mistag background. 
Additionally, Wbb and Wcc events with two heavy fla- 
vor jets are not counted as part of the mistag estimate; 
rather, the mistag estimate is scaled by the fraction of 
inclusive tagged events which are from mistags, Wc, and 
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TABLE XIII: Background summary for the single-tag selection. The total backgrounds are given before and after the correction 
for tt events in the pretag VK+jets sample. 









Ht 


> GeV 


Ht 


> 200 GeV 


Jet multiplicity 


W + 1 jet 


W + 2 jets 


W + 3 jets 


W + > 4 jets 


W + 3 jets 


W + > 4 jets 


Pretag 


15314 


2448 


387 


107 


179 


91 


Mistags 


40.9 ± 6.1 


17.0 ± 2.4 


5.2 ±0.7 


2.6 ±0.4 


3.3 ±0.4 


2.3 ±0.3 


Wbb 


37.0 ± 11.2 


22.5 ± 6.5 


5.0 ±1.3 


1.6 ±0.5 


2.8 ±0.8 


1.4 ±0.4 


Woe 


13.7 ±3.4 


8.0 ± 2.2 


1.6 ±0.5 


0.6 ±0.2 


0.9 ±0.3 


0.5 ±0.2 


Wc 


34.5 ±9.0 


7.7 ±2.0 


1.4 ±0.4 


0.3 ±0.1 


0.7 ±0.2 


0.3 ±0.1 


WW /WZ/ZZ,Z -» tt 


2.2 ±0.4 


2.5 ±0.4 


0.6 ±0.1 


0.1 ±0.0 


0.3 ±0.1 


0.1 ±0.0 


non-W QCD 


24.3 ±3.5 


10.5 ±1.9 


3.4 ±0.7 


1.4 ±0.4 


1.7 ±0.4 


1.2 ±0.3 


single top 


2.6 ±0.3 


4.6 ±0.5 


1.1 ±0.1 


0.2 ±0.0 


0.8 ±0.1 


0.2 ±0.0 


Z+HF 


1.1 ±0.3 


0.6 ±0.2 


0.2 ±0.1 


0.10 ±0.05 


Total 


156.3 ± 19.1 


73.4 ±9.8 


18.5 ± 2.2 


6.9 ±0.9 


10.5 ±1.3 


6.0 ±0.8 


Corrected Total 


156.3 ± 19.1 


73.4 ±9.8 


23.1 ± 3.0 


13.5 ± 1.8 


Data 


160 


73 


29 


28 


21 


27 
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H T (GeV) Tagged jet Ef (GeV) 



FIG. 18: Ht distribution of the 57 tagged events with three 
or more jets, compared to the expected background and tt 
signal (normalized to the theoretical cross-section of 6.7 pb). 



TABLE XIV: Measured cross section for different top quark 
mass assumptions. 



m t (GeV/c 2 ) 


a (pb) 


170 
175 
180 


5.8+^(stat.)+^(syst.) 
5.6li?(stat.)+^(syst.) 
5.4±^(stat.)+«; 9 6 (syst.) 



iio-a-W QCD background. 

The largest background comes from Wbb events, where 
both b quark jets are tagged. This background estimate 
uses the same heavy flavor fractions and tagging efficien- 
cies given in Section [3 and is normalized to the pretag 
sample of W + jet events. There is a small additional 
contribution of double-tags in Wbb events where the sec- 
ond tag is from mistags of light quark jets, so the mistag 
matrix is applied to light quark jets in the Monte Carlo 



FIG. 19: Et distribution of the tagged jets in the 48 candidate 
events with three or more jets and Ht > 200 GeV, compared 
to the expected background and tt signal (normalized to the 
theoretical cross-section of 6.7 pb). 



sample to account for this additional contribution to the 
Wbb event double-tag efficiency. The Wcc background is 
treated in the same way as Wbb. 

The non-W QCD background estimate uses the same 
lepton isolation and $ T sideband regions described in 
Section lVl Al to estimate the double-tag background from 
direct production of heavy flavor jets. There are zero 
double-tagged events with two or more jets in region B 
(2soi < 0.1 and $ T < 15 GeV) compared to 133 single- 
tagged events, implying a Poisson upper limit of 1.8% at 
90% C.L. for a single-tagged QCD event to be double- 
tagged. This is applied to the inclusive tag QCD back- 
ground estimate given in Table IXIIII and the limit is 
quoted as the uncertainty on the background estimate of 
zero double-tagged QCD events. 

The double-tag backgrounds for the electroweak and 
single top processes follow directly from the same Monte 
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TABLE XV: Prediction for the number of double-tagged 
events. Corrected total comes from the ti cross section mea- 
surement where the pretag sample is corrected for the ti con- 
tribution. The expected number of ti events is calculated 
using the measured cross section of 5.0 pb. 



Jet multiplicity 


2 jets 


3 jets 


> 4 jets 


Single top 


0.40± 0.08 


0.15± 0.03 


0.04± 0.01 


WZ 


0.15± 0.04 


0.02± 0.01 


0.01± 0.01 


Wbb 


2.76± 0.86 


0.64± 0.18 


0.21± 0.06 


Woe 


0.20± 0.08 


0.05± 0.02 


0.03± 0.01 


Mistag/QCD 


0.14 ± 0.04 


0.16 ± 0.04 


0.11 ± 0.03 


Total 


3.65 ± 0.97 


1.02 ± 0.23 


0.40 ± 0.09 


Corrected Total 


3.6± 1.0 


1.3 ± 0.3 


tt (5.0 pb) 


1.0 


2.6 


4.1 


Data 


8 


3 


5 




0.2 0.25 0.3 
Pseudo-cc (cm) 



FIG. 20: Transverse decay length significance and pseudo- 
cr distribution of the tagged jets in the 48 candidate events 
with three or more jets and Ht > 200 GeV, compared to the 
expected background and ti signal (normalized to the theo- 
retical cross-section of 6.7 pb). 



The total double-tag background estimate is given in 
Table IXVl The cross section for the double-tagged sam- 
ple is measured using the events in the three and four jet 
bins as for the inclusive tagged sample, and correcting the 
pretag sample for the ti contribution from double-tags. 
Eight double-tag events are observed on a background of 
1.3 events, implying a cross section of 



eft* = 5.0+f;g(stat)^ ( l ) ;g(syst)pb 



(7) 



The systematic error is due to the following contribu- 
tions: tagging efficiency (15%), acceptance (7%), lumi- 
nosity (6%) and backgrounds (5%). 

This result gives a consistent cross section measure- 
ment in an almost background-free sample. With a larger 
data sample, this double-tag selection may offer an im- 
proved measurement of the ti cross section. In addition, 
the double-tagged sample may prove useful in estimating 
relative contributions of the different W + jets produc- 
tion diagrams, especially gluon splitting to heavy flavor 
quark pairs. 



Carlo calculations discussed in Section lYl Dl The only 
significant contributions come from the s-channel single 
top process and WZ with Z — > bb. The total double-tag 
background estimate is given in Table IXVl 

2. Double-Tag Acceptance and Cross Section 

For the double-tag analysis the backgrounds are suf- 
ficiently low that we do not apply a cut on Ht- The 
pretag acceptance uses the same MC sample and lepton 
identification and trigger efficiency corrections described 
in Section llXl The pretag efficiencies are 4.32 ± 0.35% 
for CEM electron, 2.24 ± 0.22%_for CMUP muon and 
1.01 ± 0.13% for CMX muon tt events with three or 
more jets. The efficiency to double-tag ti events with 
three or more jets is measured from Monte Carlo to be 
0.11 ± 0.02 after correcting for the difference in tagging 
efficiency between data and Monte Carlo. 



XI. CONCLUSIONS 

The ti production cross section has been measured 
with vertex-tagged lepton + jets events from 162 pb -1 
of data collected at y/s — 1.96 TeV. The selection yields 
a sample of 48 candidate events with one lepton, large 
missing transverse energy, large total transverse energy 
H T , and three or more jets, where at least one jet has 
a displaced secondary vertex tag. A total of 13.5 ± 1.8 
events are expected from non-tt processes. The measured 
production cross section, assuming a top quark mass of 
175GeV/c 2 , is 

a(pp-^ ti) = 5.6±i[;i(stat.)+gg(syst.) pb. 

Applying the same selection, except for the large trans- 
verse energy requirement, to a double-tagged sam- 
ple (8 observed events with expected background 



20 



of 1.3 ± 0.3 events) yields a cross section of 
5.0ti;g(stat.)^ ' g (syst.) pb. Both results are consistent 
with the theoretical predictions of 6.71q'q pb, again as- 
suming m t = 175GeV/c 2 0,0. 

APPENDIX: DERIVATION OF FORMULAE FOR 
DOUBLE- TAG METHOD OF DETERMINING 
EFFICIENCY SCALE FACTOR 

The measurement of the tagging efficiency in data em- 
ploys the double-tag method and uses identified conver- 
sions to estimate the contribution of electrons which are 
fakes or part of a conversion pair. This appendix sum- 
marizes the detailed calculation of the tagging efficiency. 

Most of the electrons in the inclusive electron data 
sample (electron Er > 8 GeV with no ^requirement) 
are from conversions or fakes in light flavor jets. Away jet 
tagging enhances the heavy flavor fraction in the electron 
side, but it still needs a significant light flavor correction. 
In general, the heavy flavor production in the jets comes 
from three subprocesses: direct production, flavor exci- 
tation and gluon splitting. For simplicity, the final data 
sample can be divided into the following four subclasses: 

• Nhh- the number of events where both sides con- 
tain a heavy quark, either c or b (it includes the 
contributions of gluon splitting in both sides), 

• Nhl ■ the number of events where the electron side 
is heavy flavor and the away side is light flavor, 

• Nlh'- the number of events where the electron is 
coming from fakes and conversions and the away 
side contains heavy flavor, 

• Nll- the number of events where both sides are 
light flavors. 

By construction, we have 

Nhh + N HL + N LH + N LL = N 

where N is the total number of events passing the final 
selection. The heavy flavor contributions in the electron 
side can be determined using the measurement of heavy 
flavor fraction (see Section Hvjl . 

Nhh + N HL = F HF ■ N 

The Nlh contribution can be determined using the 
away tags in the conversion electron sample. Finally, the 
contribution of Nll is estimated using the mistags in the 
negative side. 

Let us use the following notation to help the derivation 
of efficiency measurement. 

• e' H : tagging efficiency of heavy flavor in the electron 
jet, 



• en- tagging efficiency of heavy flavor in the away 
jet, 

• e' L : mistag efficiency in the electron jets, 

• 6l- mistag efficiency in the away jets, 

• N a+ , N a -, N e+ and 2V e _: are the number of posi- 
tive, negative tags in the away jets and in the elec- 
tron jets, 

• NlX, K+i K- and K- : are the number of 
double-tags in the combination of positive or neg- 
ative tags in electron jet when the away tag is 
present, either positive or negative. 

Applying the tag in the away jets, the numbers of pos- 
itive and negative tags are: 

tH-N HH + e L -N HL + e H -N LH + e L -N LL =N a+ (A.l) 

€L-N H H + € L -N HL + e L -N L H + e L -N L L=N a -. (A.2) 

By subtracting Equation IA.2I from Equation IA.1I , we 
get 

{e H - e L ) ■ (N HH + N LH ) = AW - N a — (A.3) 

Applying the second tag on the electron side, the num- 
bers of double-tags are 

4 • (e H ~ e L ) ■ Nhh + e' L ■ (e H - e L ) ■ N LH = N°+ - N c a t 

(A.4) 

4 ■ (<tt - e L ) ■ N HH + e' L ■ (e H - e L ) ■ N LH = - N L a Z . 

(A.5) 

Subtracting Equation IA. 51 from Equation IA.41 we get 

(e' H -e' L )-(e H -e L )-N HH - (A r a e +-A r a e +)-(A r a e l-A r a e l). 

(A.6) 

From Equation IA. 31 we get 

{e H -e L )-N H H = (N a+ -N a _)-(e H -e L )-N LH (A.7) 

Substituting Equation I A. 71 into Equation IA.6I and re- 
arranging terms, the heavy flavor tagging efficiency on 
the electron jet is 

, _ , (K+ - N£) - (N£ - NZI) 
(H 6L (N a+ - N a _) - (e H - e L ) ■ N lh 

In order to determine Nlh, we select the events where 
the electron is identified as a conversion partner and the 
away side is tagged. The heavy flavor contribution in 
the away jets should not depend on whether the electron 
originated from a photon conversion or a fake. 

Let us denote the following quantities 

• /: the fraction of electrons originating from con- 
versions in no- heavy flavor jets in the electron side, 
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• /': the fraction of electrons originating from con- of conversion, which is determined using the same 
versions in heavy flavor jets in the electron side, sign, 

• e c : the efficiency of the conversion finding algo- 
rithm, # n c. -(.^g numDer of identified conversion electrons. 

• e°: the error rate of finding a real electron as a part 

I 



Applying the conversion finding algorithm to the data sample, the number of conversion electrons is 

(/' • e c + (1 - /') • e°) • (N HH + N HL ) + (f-e c + (l-f)- e°) • (N LH + N LL ) = n c (A.8) 
/ • (e c - O ■ {Nlh + N LL ) = n c - N ■ e° - f ■ (e c - e°) ■ (N HH + N HL ). (A.9) 

By looking for conversions in the tagged electron jets, we have 

(4 - e' L ) ■ (e c ■ /' + e° • (1 - /')) • (N HH + N HL ) = n c e+ - n c e _. 

Since (e' H - e' L ) ■ (N HH + N HL ) = N e+ - N e _ , we get 

e° + f ■ (e c - e°) = e c ' (A.fO) 

where < r 



(7V C+ -Ar c _)- 

Substituting Equation I A . f 01 into Equation I A. 81 . we have 



f . (£C _ ^ = nV^-^ + ^- O-^) (A . n) 

1 - -FffF 



We apply the tag to the electron jet in Equation lA.8l and the excess of tags is 

/' ■ (e c - e°) • (4 - e' L ) • (iV ffff + N HL ) = n c e+ - n%_ - (N e+ - N e -) ■ e°. (A.12) 
If we then apply the tag to the away jet in Eouation lA.8l the excess of tags is 

/ • (e c - e°) • (e H - e L ) ■ N LH = < + - n c a _ - (N a+ - iV a _) ■ e° - f ■ (e c - e°) • (e ff - e L ) ■ N HH . (A.13) 
Substituting Equations I A. 71 and I A. 101 into Equation IA. 131 
/ • (e c - e°) • (e H - e L ) ■ N LH = n c a+ - n c a _ ~ (N a+ - A a _) • e° - (e c ' - e°) • ((JV„+ - A a _) - (e H - e L ) ■ N LH ) (A.14) 



From Equation IA. 1 II and Equation IA. 141 we get 
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(e H -e L )-N LH = (N a+ -N a _)- 



n c /N - e c ' 



Finally, the efficiency can be expressed as 



where 



(N a+ — N a -) ■ F° F 



a _ jVa+-Wq- 

tHF - 1 n°/N-e< 



-■(l-F J 



HF J 
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